textsum does not converge

xiuchen commented 7 years ago

I have trained text sum for 5 days with the parameters recommended in the project page. I use a training set with more than 3 million article-summary pairs. At first running_average_loss decrease slowly from around 9 to around 4, but after that, running_average_loss value changes in a wide range, it can be as high as more than 5, but sometimes can be as low as 1. And I test the model with some article in the training set, but the output is far from the referenced summary, I'm confused. Can someone share your experience ? I'm confused with following questions

running_average_loss is less then 10 every time I run , is it normal?
dose it over fitting since running_average_loss varies in a wide range and has no sign to converge ?
how long will it take to train a model good enough or when to stop training ? Is there a sign to indicate to stop training?

poxvoculi commented 7 years ago

This question is better asked on StackOverflow since it is not a bug or feature request. There is also a larger community that reads questions there. Thanks!

anthnyprschka commented 7 years ago

Have you posted your question on SO yet? Would be interested to follow the discussion

tensorflow / models

textsum does not converge #2125