andyljones / boardlaw

Scaling scaling laws with board games.
https://andyljones.com/boardlaw
MIT License
38 stars 7 forks source link

Experiment Runner #12

Closed andyljones closed 3 years ago

andyljones commented 3 years ago

I'm coming to the point where I want to do lots of training runs in parallel. I've got a lil bit of vast.ai orchestration code already that I'm pretty happy with, but it's a long way short of 'easy to use'.

I could go find a big ol map-reduce framework, but it seems a bit much for my pretty trivial use case of 'separate job on every GPU, pull the output folders back to the master'. So! Roll-my-own time:

Principles

Submission

Dispatch

Monitoring

andyljones commented 3 years ago

Built as jittens. Still need to write up docs, but that can wait until it's been used in anger a few times.