gaow0007 / code-reading

MIT License
0 stars 0 forks source link

Online Workload Allocation and Energy Optimization in Large Language Model Inference Systems #190

Open gaow0007 opened 1 week ago

gaow0007 commented 1 week ago

https://github.com/grantwilkins/energy-inference