Closed ManSchri closed 7 years ago
It is a property of stochastic gradient descent that you will get slightly different answers for each run. However, on this particular data set, results didn't vary more than 1%, and were pretty stable, so we reported only a single pass.
If you're more comfortable doing multiple passes, you can easily modify the code to do that, and average over them...
I have noticed the evaluation results for the half-life regression model are slightly different each time I run the code. So I was wondering how the evaluation results presented in the paper are determined.
Are the results given in table 2 of the paper based on one run or are they based on multiple runs (using the best result or the average or something else)?
Thanks!