Farahn / AES

Automatic Essay Scoring
33 stars 12 forks source link

where is data? #1

Closed zysNLP closed 5 years ago

zysNLP commented 5 years ago

As in BERT_BCA_train.ipynb line 20 described, where is data/TOEFL/X_train_TOEFL.npy? Could you please offer to me? thank you very much!

Farahn commented 5 years ago

Hi, the data can be created using Bert as service and the scripts TOEFL_dataParse.ipynb or ASAP_dataParse.ipynb and BERT_text_representation.ipynb. The TOEFL data can be obtained from LDC, and the ASAP data can be obtained from Kaggle. Let me know if you have questions.

zysNLP commented 5 years ago

OK,Thank you very much! But I wonder how to download LDC data from its website? I couldn't find the hyperlink.

Farahn commented 5 years ago

The LDC data is available, but requires a license.

zysNLP commented 5 years ago

Thank you, but I still can't get data. Should I fill in the download license "ets-corpus-of-non-native-written-english.pdf" and then submit to them?

Farahn commented 5 years ago

Here are the details for obtaining data: https://www.ldc.upenn.edu/language-resources/data/obtaining

zysNLP commented 5 years ago

It seems to need a lot of money...

zysNLP commented 5 years ago

Could you please give me some data?