google-research / albert

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Apache License 2.0
3.23k stars 571 forks source link

Where to download "all.txt" to train the sentencepiece? #238

Open YuHengKit opened 3 years ago

YuHengKit commented 3 years ago

Hi, thanks for the effort made. I wish to know Is it using RACE dataset? Any sample code to compile them into "all.txt"?