daniel-kukiela / nmt-chatbot

NMT Chatbot
GNU General Public License v3.0
385 stars 212 forks source link

Chatbot replies http webpage link? #100

Open Kinwong1103 opened 5 years ago

Kinwong1103 commented 5 years ago

I have trained my model using reddit comments following sentdex's youtube tutorial. However, my chatbot usually replies with answer like "https://reddit/xxx/xxxx" when i ask him simple question like "how are you today?" May I ask that how've i done wrong or how to prevent this issue?

aakashrkumar commented 5 years ago

i have this save ishue, it alewase responds with "i dont know what your talking about" or a reddit/giphy or some other link I am useing tensorflow 1.9 and 2015-1 database

daniel-kukiela commented 5 years ago

That means that you have too few data in your training dataset or you have to train it longer. Chatbot is bound somehow to links. You could also remove from training data all pairs containing links.

aakashrkumar commented 5 years ago

I have around 4 million pairs and I have done a little over 1 epoch, home much more should I train to get a better bot?

daniel-kukiela commented 5 years ago

My test bot is trained on 1 or 2 million pairs iirc for 5 epochs. Give it a try for 1 or 2 more epochs.

tarunkolla commented 5 years ago

Had the same problem. Was training a bot with 1M pairs and all that It would respond is "I dont know" or an incomplete web link. So I choose a different dataset and got like 2.5M pairs and wrote a script to clear all the http/s links in the data. My bot now is doing much better. No links and luckily no "I Dont Know" answers. I did 4 epochs with about 75k steps in total and the bot is decent enough.