allenai / bilm-tf

Tensorflow implementation of contextualized word representations from bi-directional language models
Apache License 2.0
1.62k stars 452 forks source link

why should vocabulary file be sorted in descending order by token count in our training data? #228

Open showerage opened 4 years ago