What are the train/test splits of T-Rex used in the paper?

Hi. Thank you for the great work.

I was wondering if you could provide the train/test splits of T-Rex used in the paper?
In your README file, there is a link to download the file `CHOLAN-EL-TREX.tsv`. But there is no indication of which line in the file belongs to the train set or the test set.

Furthermore, I counted the number of data lines in that file. There were 1,089,661 data lines (except the header line). However, your paper mentions that "the dataset has 983,257 sentences". So was the file the same data you used in your paper?

Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What are the train/test splits of T-Rex used in the paper? #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

What are the train/test splits of T-Rex used in the paper? #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions