n-waves / multifit

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761
MIT License
282 stars 56 forks source link

error in LM pretraining #62

Open blazejdolicki opened 4 years ago

blazejdolicki commented 4 years ago

What I did?

From initial debugging, train.items is an ndarray with shape () . When I print it, it returns articles in German. I suppose this part suggests a problem Wiki text was split to 1 articles - I reckon the wiki text should be split in more than 1 article. So maybe something goes wrong in read_wiki_articles() in dataset.py... This is my educated guess, but I don't know where to go from here.

blazejdolicki commented 4 years ago

My package versions differ slightly from those in requirements.txt, maybe sacremoses is related: fire 0.3.0 sacremoses 0.0.38 sentencepiece 0.1.85 fastai 1.0.47