dhlee347 / pytorchic-bert

Pytorch Implementation of Google BERT
Apache License 2.0
589 stars 181 forks source link

Can you please provide books_large_all.txt? #17

Closed AyanKumarBhunia closed 4 years ago

dhlee347 commented 4 years ago

It is "Toronto Book Corpus", but the author don't provide it any more. But I think you can train the model with any text data.