Closed zysNLP closed 5 years ago
Hi, the data can be created using Bert as service and the scripts TOEFL_dataParse.ipynb or ASAP_dataParse.ipynb and BERT_text_representation.ipynb. The TOEFL data can be obtained from LDC, and the ASAP data can be obtained from Kaggle. Let me know if you have questions.
OK,Thank you very much! But I wonder how to download LDC data from its website? I couldn't find the hyperlink.
The LDC data is available, but requires a license.
Thank you, but I still can't get data. Should I fill in the download license "ets-corpus-of-non-native-written-english.pdf" and then submit to them?
Here are the details for obtaining data: https://www.ldc.upenn.edu/language-resources/data/obtaining
It seems to need a lot of money...
Could you please give me some data?
As in BERT_BCA_train.ipynb line 20 described, where is data/TOEFL/X_train_TOEFL.npy? Could you please offer to me? thank you very much!