bigcode-project / bigcode-analysis

Repository for analysis and experiments in the BigCode project.
Apache License 2.0
109 stars 20 forks source link

add scaling laws notebook #35

Closed lvwerra closed 1 year ago

lvwerra commented 1 year ago

Notebook to replicate Chinchilla scaling laws. Follows steps in D.2 (however for pass@k and its inverse rather than loss).

Randl commented 1 year ago

The csv file is missing?

lvwerra commented 1 year ago

Indeed, added it here: https://huggingface.co/datasets/bigcode/scaling-laws-exp/tree/main

harm-devries commented 1 year ago

Just pushed a fix for the optimization issues in the scaling laws. Fitted function now looks as follows:

Unknown-12

Randl commented 1 year ago

Probably better make x-axis logscale