guoqincode / Open-AnimateAnyone

Unofficial Implementation of Animate Anyone
2.85k stars 230 forks source link

Getting DDP Inconsistent Params error on train #88

Open CaganKiper opened 6 months ago

CaganKiper commented 6 months ago

I'm getting RuntimeError: DDP expects same model across all ranks, but Rank 0 has 686 params, while rank 1 has inconsistent 0 params. while trying to train the model. I'm using 8 A100 GPUs as recommended with and batch-size 64 in stage one. I've not been able to start the train_hack code past unet = DDP(unet, device_ids=[local_rank], output_device=local_rank) line.