stanford-crfm / ecosystem-graphs

257 stars 35 forks source link

Convert all `training_time` fields to use petaflop/s-day #31

Closed dilarasoylu closed 2 years ago

dilarasoylu commented 2 years ago

Purpose

This PR standardizes the training_time field to always use petaflop/s-day. Simply reporting the number of hours or days trained makes it hard to compare different models due to the use of different hardware. We use the methods outlined in [OpenAI AI and Compute] post to convert the training time reported in number of hours, days or petaflops to petaflop/s-day. The petaflop/s-day metric is also a useful metric as it is commonly used to report training time in the newly released models.

Comments

Although it provides a standard look at the training times of different models, the petaflop/s-day may not be meaningful to the general public. I propose using it as the standard metric in our assets, but converting it into different metrics on the UI, depending on our needs (we can convert petaflop/s-day back into hours or days by saying that we assume that all the models used a certain GPU).