google-research / albert

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Apache License 2.0
3.23k stars 570 forks source link

Fix running experiments on RACE (no all.txt) #185

Closed twilightdema closed 4 years ago

twilightdema commented 4 years ago

ROOT CAUSE: Running experiments on RACE dataset downloaded from: https://www.cs.cmu.edu/~glai1/data/race/ produces error because all.txt is not existed. Apparently, the RACE dataset contained many .txt files, each file is a 1 JSON object in single line.

FIX: To read them correctly, we should iterate over all .txt file in the directory and parse them.

googlebot commented 4 years ago

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google. In order to pass this check, please resolve this problem and then comment @googlebot I fixed it.. If the bot doesn't comment, it means it doesn't think anything has changed.

ℹ️ Googlers: Go here for more info.