Hangz-nju-cuhk / Talking-Face_PC-AVS

Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
Creative Commons Attribution 4.0 International
916 stars 169 forks source link

Why start from 2? #61

Open 9B8DY6 opened 2 years ago

9B8DY6 commented 2 years ago

self.target_frame_inds = np.arange(2, len(self.spectrogram) // self.audio.num_bins_per_frame - 2) In voxtest_dataset.py L107, target frame index starts from 2. But in the paper, it starts from 1 (it means start from 0 in python). I don't understand why it is. Thanks.