Closed Reginald-L closed 1 week ago
Hi, I am using deepspeed zero3 to fine tune flux model using kohya scripts.
flux = accelerator.unwrap_model(flux) print(f"flux - {flux.state_dict()['single_blocks.7.linear1.weight'].shape}") print(f"flux - {flux.state_dict()['single_blocks.7.linear1.weight'].device}")
I got the below result:
and When I save the trained model, I got this:
Here is my zero config:
Could you try the with deepspeed.zero.GatheredParameters(params) context?
with deepspeed.zero.GatheredParameters(params)
Cool, thanks very much, your solution is pretty useful
Hi, I am using deepspeed zero3 to fine tune flux model using kohya scripts.
I got the below result:
and When I save the trained model, I got this:
Here is my zero config: