Open jettjaniak opened 4 months ago
not an issue for our stories-* suite as stories dataset is shuffled, but could be an issue for other datasets, because of concatenation during tokenization
not an issue for our stories-* suite as stories dataset is shuffled, but could be an issue for other datasets, because of concatenation during tokenization