johndpope / SPEAK-hack

Using Claude Sonnet to reverse engineer paper Listen, Disentangle, and Control: Controllable Speech-Driven Talking Head Generation
https://arxiv.org/pdf/2405.07257
7 stars 0 forks source link

Feat/progressive training resolution progression [64, 128, 256, 512] #2

Closed johndpope closed 4 months ago

johndpope commented 4 months ago

WIP debug_step_2000_resolution_64