Hi, I am pretty new here. I would like to know how do we provide the dataset instead of using the one that comes with the library. I have a parallel corpus that is stored in two text files as source.txt and target.txt. I see all the annotated transformer model use the dataset that comes with the library and not with a custom dataset. Please provide a link or tutorial so that I can modify my dataset in order to feed into this model. Many Thanks.
Hi, I am pretty new here. I would like to know how do we provide the dataset instead of using the one that comes with the library. I have a parallel corpus that is stored in two text files as source.txt and target.txt. I see all the annotated transformer model use the dataset that comes with the library and not with a custom dataset. Please provide a link or tutorial so that I can modify my dataset in order to feed into this model. Many Thanks.