Fix weight handling for tuple case

foundation-model-stack / fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

https://pytorch.org/docs/stable/fsdp.html

Apache License 2.0

116 stars 18 forks source link

Fix weight handling for tuple case #36

Closed daviswer closed 4 months ago

daviswer commented 4 months ago

The recent fix for weight handling included a bug where a weight coming in as a tuple would return list(data) rather than list(weight). This corrects the bug.