LeelaChessZero / lc0

The rewritten engine, originally for tensorflow. Now all other backends have been ported here.
GNU General Public License v3.0
2.38k stars 525 forks source link

Fix layernorm epsilon for smolgen weights. #1914

Closed almaudoh closed 1 year ago

almaudoh commented 1 year ago

The value of epsilon in the layer norms for smolgen weights are 1e-3, but were coded as 1e-6 in some backends. This may not have a huge impact on Elo, though, but may be worth testing.