[x] pytorch takes too long to install - wont fix just use modal instead @msaroufim
[x] Killing long running jobs so they don't bork up the queue
[x] Setup modal scheduler @msaroufim - main gap right now is passing in configs to remotee machine
Testing infra @S1ro1
Right now this is omega jank
[x] We have a staging env but its slow to test - improve the local development experience @b9r5
[x] Merging PRs feels very yolo still, I'm never sure something works until I test it so some CI sanity tests would be nice @b9r5
[x] Maybe we don't "test" but we fix fast because we work in different timezones
[x] Modularize code with Discord cogs so its easier to maintain and isolate breakages @S1ro1
What do people upload
[x] numpy script
[x] torch script
[x] triton script @alexzhang13
[ ] cuda script - this is the trickiest but we need to on our end have most of the boilerplate including things like launch params. For advanced users we can get launch params from slash @alexzhang13
[x] Basic cuda support @msaroufim - still jank and it breaks with things like \n
Discord based leaderboard
UX
Leaderboard infra
GPU infra
Startup times
The faster the startup time the more interactive the bot becomes and more popular
Testing infra @S1ro1
Right now this is omega jank
What do people upload
Profiling/Ranking