Closed TZeng20 closed 1 month ago
@TZeng20 yes, inferece with FSDP is not recommended due to the allgather call before each forward pass. Mostly we have explored TGI and VLLM as suggested here.
Hi, I'm also facing this issue. My models are saved like "__0_0.distcp", "__1_0.distcp"... and so on. How can I load these models so that I can run model.generate?
Hi,
In the inference scripts, I see that there is no option to perform inference with FSDP.
Is
model.generate
not recommended when it is wrapped in FSDP? Or DDP?Thanks