chiphuyen / stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
http://cs20.stanford.edu
MIT License
10.32k stars 4.32k forks source link

Loss not improving after 2.2 #149

Open atinesh-s opened 4 years ago

atinesh-s commented 4 years ago

I have trained the model for quite a long (till Iteration 1000000), But loss seems to get stuck around 2.2 and bot response is also not satisfactory

HUMAN ++++ hello
BOT ++++ ?
HUMAN ++++ who are you
BOT ++++ . . .
HUMAN ++++ whats your name
BOT ++++ .
HUMAN ++++ where are you from
BOT ++++ ' s gone .
HUMAN ++++ what is the time there
BOT ++++ ' s not going to be a bad thing .
HUMAN ++++ I do not understand anything
BOT ++++ !
2019-10-09 14:01:09,550 -- INFO -- Iter 981000: loss 2.2273828502893447, time 0.08978772163391113
2019-10-09 14:03:29,982 -- INFO -- Iter 982000: loss 2.251806115269661, time 0.1407618522644043
2019-10-09 14:05:51,542 -- INFO -- Iter 983000: loss 2.2446057442426683, time 0.2159106731414795
2019-10-09 14:08:11,021 -- INFO -- Iter 984000: loss 2.2406707513332367, time 0.09187722206115723
2019-10-09 14:10:33,569 -- INFO -- Iter 985000: loss 2.243191688776016, time 0.284625768661499
2019-10-09 14:12:52,848 -- INFO -- Iter 986000: loss 2.229316849589348, time 0.09166932106018066
2019-10-09 14:15:12,966 -- INFO -- Iter 987000: loss 2.238621209859848, time 0.0910036563873291
2019-10-09 14:17:32,610 -- INFO -- Iter 988000: loss 2.2455421295166014, time 0.09028363227844238
2019-10-09 14:19:50,839 -- INFO -- Iter 989000: loss 2.2432832870483397, time 0.34220433235168457
2019-10-09 14:22:06,687 -- INFO -- Iter 990000: loss 2.2466694812774657, time 0.09603047370910645
2019-10-09 14:22:15,398 -- INFO -- Test bucket 0: loss 3.1786046028137207, time 0.05455660820007324
2019-10-09 14:22:15,458 -- INFO -- Test bucket 1: loss 3.5405633449554443, time 0.05947542190551758
2019-10-09 14:22:15,532 -- INFO -- Test bucket 2: loss 3.5922553539276123, time 0.07441473007202148
2019-10-09 14:22:15,633 -- INFO -- Test bucket 3: loss 3.529684543609619, time 0.10061907768249512
2019-10-09 14:22:15,755 -- INFO -- Test bucket 4: loss 3.7335658073425293, time 0.12195014953613281
2019-10-09 14:22:15,903 -- INFO -- Test bucket 5: loss 3.8613126277923584, time 0.14754486083984375
2019-10-09 14:24:21,939 -- INFO -- Iter 991000: loss 2.2421313049793246, time 0.08835697174072266
2019-10-09 14:26:39,390 -- INFO -- Iter 992000: loss 2.2535737413167953, time 0.16527605056762695
2019-10-09 14:29:01,609 -- INFO -- Iter 993000: loss 2.263053869485855, time 0.1388380527496338
2019-10-09 14:31:20,481 -- INFO -- Iter 994000: loss 2.2561977257728576, time 0.08809399604797363
2019-10-09 14:33:43,944 -- INFO -- Iter 995000: loss 2.2676013087034224, time 0.09026694297790527
2019-10-09 14:36:02,296 -- INFO -- Iter 996000: loss 2.2528336789608003, time 0.09229564666748047
2019-10-09 14:38:20,023 -- INFO -- Iter 997000: loss 2.2519494262933732, time 0.08894729614257812
2019-10-09 14:40:37,962 -- INFO -- Iter 998000: loss 2.266415533065796, time 0.08812499046325684
2019-10-09 14:42:58,623 -- INFO -- Iter 999000: loss 2.2743323941230775, time 0.09042882919311523
2019-10-09 14:45:17,886 -- INFO -- Iter 1000000: loss 2.2651904397010805, time 0.08829283714294434
2019-10-09 14:45:26,655 -- INFO -- Test bucket 0: loss 3.332874059677124, time 0.05175375938415527
2019-10-09 14:45:26,718 -- INFO -- Test bucket 1: loss 3.459939956665039, time 0.06204557418823242
2019-10-09 14:45:26,791 -- INFO -- Test bucket 2: loss 3.472844123840332, time 0.07347464561462402
2019-10-09 14:45:26,889 -- INFO -- Test bucket 3: loss 3.750105857849121, time 0.09812140464782715
2019-10-09 14:45:27,012 -- INFO -- Test bucket 4: loss 3.4856009483337402, time 0.1224663257598877
2019-10-09 14:45:27,151 -- INFO -- Test bucket 5: loss 3.558220863342285, time 0.13911890983581543
2019-10-09 14:47:39,301 -- INFO -- Iter 1001000: loss 2.268606811881065, time 0.08983755111694336
2019-10-09 14:50:02,334 -- INFO -- Iter 1002000: loss 2.275747295618057, time 0.09300374984741211
2019-10-09 14:52:24,905 -- INFO -- Iter 1003000: loss 2.265431757092476, time 0.2277059555053711
2019-10-09 14:54:45,345 -- INFO -- Iter 1004000: loss 2.2564414784908293, time 0.1413567066192627
2019-10-09 14:57:04,630 -- INFO -- Iter 1005000: loss 2.2581456750631332, time 0.0926673412322998
2019-10-09 14:59:28,269 -- INFO -- Iter 1006000: loss 2.268891993880272, time 0.21859478950500488
2019-10-09 15:01:54,166 -- INFO -- Iter 1007000: loss 2.269172732114792, time 0.09133005142211914
2019-10-09 15:04:14,596 -- INFO -- Iter 1008000: loss 2.2641460099220274, time 0.3426644802093506
2019-10-09 15:06:40,198 -- INFO -- Iter 1009000: loss 2.2892661439180375, time 0.2196190357208252