Hi. Thank you for the great work.
I was wondering if you could provide the train/test splits of T-Rex used in the paper?
In your README file, there is a link to download the file CHOLAN-EL-TREX.tsv. But there is no indication of which line in the file belongs to the train set or the test set.
Furthermore, I counted the number of data lines in that file. There were 1,089,661 data lines (except the header line). However, your paper mentions that "the dataset has 983,257 sentences". So was the file the same data you used in your paper?
Thank you.