-
The README lists [Calibrating the Adaptive Learning Rate to Improve Convergence of ADAM](https://arxiv.org/abs/1908.00700v2) by Tong, Liang, and Bi (2019) as the source paper accompanying the `Ranger`…
-
Hi, I'm not an expert but I have a few questions:
While ZORB is impressive performance wise, how much can variate the accuracy gap vs Adam?Extensive testing is needed.
Can we port transformers suc…
-
https://github.com/Yonghongwei/Gradient-Centralization/commit/d46e4c54ae47b730d0805694849f106c41828e97
-
Hi~
https://github.com/juntang-zhuang/Adabelief-Optimizer/issues/18#issue-729329117
Since I asked you question last time, I've done a series of experiments. I think both methods of determining the …
-
Hello,
Thank you for creating a nice project for nnue training in pytorch!
I am trying to use your project to create a network for Igel. I wanted to ask you if it is possible to have trainer in …
-
Late to the party, but once again good work to you all @LiyuanLucasLiu !
So I was testing RAdam vs AdamW on simple linear models [ie Logistic Regression / Linear Regression]. Obviously for these sm…
-
https://github.com/lessw2020/Ranger-Deep-Learning-Optimizer
-
-
Hello,
Thanks for very interesting project and contributing to NNUE training.
I am trying to use the trainer for Igel and when running the test command:
```
python train.py total_3m_d14.bin …
-
Hi @saberkun, @zihangdai, @graykode, @bzantium
The original [zihangdai/XLNet](https://github.com/zihangdai/xlnet) repository doesn't get any update recently. Should we assume that the XLNet impleme…