brucefan1983 / GPUMD

Graphics Processing Units Molecular Dynamics
https://gpumd.org/dev
GNU General Public License v3.0
417 stars 110 forks source link

better hyperparameters in snes #617

Closed brucefan1983 closed 2 months ago

brucefan1983 commented 2 months ago

Now I understand why SNES needs so large regularization previously. It was due to the too large initial searching variance (learning rates for the parameters).