The model architecture seems to lack poseguider + backbone + referencenet

this is another paper implementation with training code (n.b. MooreThreads is a competitor to Nvidia) https://github.com/MooreThreads/Moore-AnimateAnyone/blob/master/train_stage_1.py

I pulled apart their architecture and it's very complex. I like how this code is very succinct.

I'm wanting to implement the EMO - emote through portrait paper using the celebhq-v dataset. https://github.com/johndpope/emote-hack

My progress is slow. I think you have reduced a lot of complexity by using diffusers - though it could be extended to use poseguider and other models to help stear the output. Please consider releasing training code and or name a price to opensource it.

bendanzzc / AnimateAnyone-reproduction

The model architecture seems to lack poseguider + backbone + referencenet #19