Thanks for providing animate_anything_svd_v1.0 model.
May i ask how many videos are used to finetune the svd model to acquire animate_anything_svd_v1.0 ?
and maybe the different between svd and svd_mask model is the conv_in module? for the input dimension from 8 to 9.
Thanks for providing animate_anything_svd_v1.0 model.
May i ask how many videos are used to finetune the svd model to acquire animate_anything_svd_v1.0 ? and maybe the different between svd and svd_mask model is the conv_in module? for the input dimension from 8 to 9.