Closed sven-nm closed 2 years ago
Thanks a lot, @sven-nm! Would it be possible to add a test for tsv_to_torch_datasets
?
And also bump the version to 0.3.0
.
@mromanello, I think its all good now, I let you have a final check before merging.
feature missing : #3 does not allow for cutting and recycling samples with a length superior to the model's max_length
. To be fixed before merge.
Also : untokenize the dataset. tokenization should be done afterwards.
@mromanello @simon-clematide added just a few functionalities there and corrected the previous comment, now ready for a merge ;-)