Closed dungxibo123 closed 2 years ago
Hi Tien, your assumption is more or less correct. We used either random or Bayesian hyperparameter tuning to get these numbers. Initially we did this with Ray Tune (there's a ray_tune.py module for this), but these days we use Weights and Biases. Hope this helps
Dear authors,
I have read your works on GRAND and BLEND papers and have checked the implementation for those papers.
I have a following question. In
src/best_params.py
I saw thatI notice that there are several parameters was used here came from some unknown process, such as:
'decay': 0.00507685443154266
'dropout': 0.046878964627763316
'tol_scale': 821.9773048827274
So how could you get those above hyperparameter. I have asked my senior, he said that maybe it come from some Bayesian Optimization based on Gaussian Process. But we are not sure about our idea.
So can you explain for me, which methods you have used and how can it be implement.
Thanks in advance,
Best regards, Tien Dung