issues
search
foundation-model-stack
/
fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
https://pytorch.org/docs/stable/fsdp.html
Apache License 2.0
162
stars
27
forks
source link
add 1.4b variant config
#40
Closed
lchu-ibm
closed
6 months ago