axolotl-ai-cloud / axolotl

Go ahead and axolotl questions
https://axolotl-ai-cloud.github.io/axolotl/
Apache License 2.0
7.58k stars 822 forks source link

transformers memory tool #848

Open winglian opened 10 months ago

winglian commented 10 months ago

āš ļø Please check that this feature request hasn't been suggested before.

šŸ”– Feature description

https://tinkerd.net/blog/machine-learning/distributed-training/#memory-requirements-of-transformers

Screenshot 2023-11-13 at 8 54 57 PM

we should add this as a cli tool that calculates a memory estimate from the YML file.

āœ”ļø Solution

see above

ā“ Alternatives

No response

šŸ“ Additional Context

No response

Acknowledgements

KCaverly commented 8 months ago

This would be pretty useful for me, Iā€™d be happy to give this a go this week.

NanoCode012 commented 5 months ago

Just fyi, HF has a space for this: https://huggingface.co/spaces/hf-accelerate/model-memory-usage