Closed cahya-wirawan closed 4 years ago
Thank you! Fyi. I've rewritten the repository to clean up the dead code, use fastai sentencepiece and to make it more consistent in the way we handle hyper-parameters. The work is on multifit branch. I'm going to change to release this as 1.0 soon, so what is in the master will be archived.
Fastai changed the argument name of load_data() function from 'fname' to 'file'. This PR will fix the bug in pretrain_lm.py which still use fname in load_data() function. It add also a question about minimal tokens/article to prepare_wiki.sh with the default value of 100.