daniel-kukiela / nmt-chatbot

NMT Chatbot
GNU General Public License v3.0
387 stars 214 forks source link

about dataset #140

Open youngornever opened 4 years ago

youngornever commented 4 years ago

I want to know where does the new_data come from? Can I use it in my paper? Moreover, I find new_data/tst2012 and new_data/tst2013 have the same content.

kaljitism commented 4 years ago

You need to watch Sentdex's tutorials on extracting and preprocessing the data from files.pushshift.io/reddit/comments