Closed HiTrong closed 3 months ago
When i checked again the dataset from hugging face (link), I realized that the dataset had a lot of errors. For example: chinese, special symbol, wrong translation,... Specially, the distribution of the lengths is not the same. The length is skewed towards the short side. Next, I checked agian the architecture. My Positional Encoding Function has not been set 'requiresgrad(False)' since my building model. Finally, I decided to remake my building model and chose other dataset which was collected completely correctly.