Closed eliebak closed 4 months ago
Added a 1-sqrt function for the cooldown phase. This function can outperform the classical linear decay method. From this paper https://huggingface.co/papers/2405.18392.
Added a 1-sqrt function for the cooldown phase. This function can outperform the classical linear decay method. From this paper https://huggingface.co/papers/2405.18392.