brightmart / text_classification

all kinds of text classification models and more with deep learning
MIT License
7.87k stars 2.57k forks source link

intermediate data files #126

Open rbaral opened 5 years ago

rbaral commented 5 years ago

First of all thanks for your effort to make this repo interesting. I ran the preprocessing notebook and was able to get some of the files, however the other scripts use lot of data files which is not easily accessible. I tried lot of time getting the Baidu storage account but couldn't because of oversees phone number. I was just wondering if you can share the script that generates those data files you used in your scripts.

Sylv-Lej commented 5 years ago

You can download the dataset on the website of the contest :

https://biendata.com/competition/zhihu/

Here is the dropbox link, I don't know how long it will work :

https://www.dropbox.com/s/3sk2yojptodkmb2/ieee_zhihu_cup.rar?dl=0