Closed youngstu closed 1 year ago
We use it due to the efficiency concern and find the performance is not bad. BTW, we have tried teacher-forcing training in FaceFormer and found the performance difference is quite limited.
We use it due to the efficiency concern and find the performance is not bad. BTW, we have tried teacher-forcing training in FaceFormer and found the performance difference is quite limited.
Got it, thanks.
Why using teacher-forcing scheme? Teacher-forcing scheme proved to be worse than autoregressive scheme in many paper such as Faceformer and FaceXHuBERT?