DinoMan / speech-driven-animation

949 stars 289 forks source link

Is it just me? #67

Closed illtellyoulater closed 2 years ago

illtellyoulater commented 2 years ago

I know this is all open source, but... don't you guys think the result quality here is far from being even remotely usable? Or is it just me? I've just tried making some videos using the provided models, and well... what's the point if not even the example image can produce a good result ?

DinoMan commented 2 years ago

Hi, the example should work just fine. How are you running the code? Also, the library should work well on all examples from the GRID, CREMA-D and TCD TIMIT datasets when using the corresponding pre-trained models but those models will not generalize to unseen faces in the wild since they are trained on datasets with too few identities. We have not released the model trained on LRW which can generalize to any face. For the example (which is from the GRID dataset) you should use the pretrained model from GRID.