Closed DahuoJ closed 5 years ago
Hi @DahuoJ , thanks for your interest in the repo. You may increase the number of training epochs between evaluations given by --epochs_per_eval, whose default value is 1.
Thanks, this has an effect. Does this mean that --train_epochs also needs to be adjusted?
No. --train_epochs
indicates the number of iteration you train the training dataset. Whereas the --epochs_per_eval
means how often you evaluate dateset during training. Though, of course, if the number of training is not sufficient, you need to increase --train_epochs
.
Thanks for your advice, I have started normal training and it takes about 14 hours to get results. 👍 👍
Hello!! @rishizek ,I can run it successfully! but I have some doubts.
![z vz ef jq45l 43 v5q w](https://user-images.githubusercontent.com/44385545/51594793-39324d80-1f30-11e9-9c8c-e58bf1f3a43c.png)
It's too slow, and it's not reasonable to save weights every 4 times of training. I'm not sure where I need to make code changes. I wish it cannot save checkpoint for 4 into ./model/model.ckpt and start evaluation. What should I do to make it display only the following two messages?![w 4 g35m q cfhp ks8 3v](https://user-images.githubusercontent.com/44385545/51594607-c6c16d80-1f2f-11e9-8f15-8275543b5e32.png)
I guess it may be because my own data set is too small? I am very confused, How can I make it smooth? Any help will be appreciated!!