Closed MustafaCelen closed 1 year ago
Thanks for reporting this @MustafaCelen
It def is an unexpected behaviour. I'd expect each trial to be quite similar (like the first too) and maybe increase timings given it has to append its results to the returned object. On our end, we can check wether we are reprocessing unnecessary information after each trial to reduce timings a bit, but not very confident this is happening. @Amyhaoming can you please double check for us?
The large increase in the 4th trial makes me believe there's something else running on that machine. @Amyhaoming can you try and replicate with 10 trials using the demo data please? Do we experience the same behaviour?
@MustafaCelen when analyzing your results, doesn't your models converge way before reaching the 15000 iterations per trial? Have you checked? Also, are the 10 trials really that different? I'd expect 5 trials to be OK though given they all start from a different random state and try to converge to similar results. Having 5 trials sounds good but I can't say for sure with your data. And yes, running Weibull takes a bit longer given it has an additional hyperparameter than Geometric, thus more flexibility but also slower.
Thank you for your reply @laresbernardo.
It was the first time I tried with 15000 iterations , I usually go with 10000 but the same problem appears. The VM is dedicated for me but i will double check with the system admins. For the trials my current best model was in the 7th trial , just to make sure I go with 10. NRMSE usually converges however Decomp rssd does not , when I finished the training with 15k iterations it was the firs time that decomp rssd converged. However not many improvements happened in the model.
Please check this answer: https://www.facebook.com/groups/robynmmm/posts/1358750738226389 In short, DECOMP.RSSD doesn't necessarily has to converge.
Please reopen if necessary.
Hi, as you see below the duration after the 4th trial has increased a lot. I have been facing this issue a lot , I use the recommended weibull pdf hyperparameter ranges, only one channel I gave the max degree of freedom for all alpha, gamma, shape and scale. Is it normal to have such increase.