Tuning positional encoding period when adding more speaker data - Githubissues

EvelynFan / FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

MIT License

790 stars 134 forks source link

Tuning positional encoding period when adding more speaker data #89

Open khalidhnv opened 1 year ago

khalidhnv commented 1 year ago

First of all, thanks for the great work!

I've been generating my own data (8 speakers in different language) and training together with VOCASET (8 speakers in English). Since the period hyper-parameter for positional encoding is related with the speakers, I was wondering

if you need to tune period hyper-parameter with 16 speakers
if you have any recommendation for tuning method

Thanks in advance!