bigscience-workshop / petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
https://petals.dev
MIT License
8.89k stars 489 forks source link

Remove smaller limit for legacy bfloat16 serialization #505

Open borzunov opened 10 months ago

borzunov commented 10 months ago

Revert #251 since it's not needed after #311. This may improve fine-tuning efficiency for medium-sized batches.

TODO: