Open YutoNishimura-v2 opened 10 months ago
https://github.com/pytorch/pytorch/blob/v2.0.0/torch/distributed/fsdp/_runtime_utils.py#L334
I found that in torch v2.0.0, _handles is used. Therefore, this is the version problem, I think.
Can confirm your patch works for me too @YutoNishimura-v2.
Hello,
I am attempting to train the medium model of musicgen using FSDP. I am simply using the following command:
dora run -d [other options see training docs] fsdp.use=true autocast=false
However, I encountered the following error:
Upon inspecting the PyTorch implementation, I don't believe _handles is a list. The problem was resolved after I changed the code to the following (I'm not sure if this is the expected behavior):
Could this be a bug due to different PyTorch versions? I am using version 2.1.0+cu121. My Python version is 3.10.10.
Looking forward to your response. Thank you.