ridgerchu / matmulfreellm

Implementation for MatMul-free LM.
Apache License 2.0
2.5k stars 139 forks source link

scaling_law.png #24

Open clort81 opened 1 week ago

clort81 commented 1 week ago

Are the labels on the solid blue and yellow lines swapped?

The dashed lines appear to show a lower loss for MatMul-Free LM, while the solid lines show a higher loss.

ridgerchu commented 5 days ago

Hi,

actually the solid line is the scaling law, which is a fitted line with the real loss, and dash line is the real loss. it seems correct.