-
hello shrubb!
Thanks for sharing your research and code. I research Neural Talking Head based on your works, and it will be helpful if you give me some advice.
I think it will be an extension of…
-
Amazing work! I am trying to reproduce the results of the paper. But i saw two config file for training on voxceleb dataset. I am wondering about the training details since there are not a lot in the …
-
It is a bug but more a clarification that I need to understand the speakerNet model.
I need to do speaker verification using speakerNet. So I follow the tutorial https://github.com/NVIDIA/NeMo/blob/m…
-
Thank you for your job and very detailed explanation!
I have download the voxceleb2 & lrs3, and used your 'generate_TalkSet.py' code and successfully got the TalkSet! But it looks like a difference …
-
In the script of downloading `get_hi-mia_data.py` there is a small bug. I think line `l.output.write_wav(line, y, sr)` in `__process_data ` needs to be changed by `sf.write(line, data=y, samplerate=s…
-
When I ran the train script, I got this error.
File "/opt/voxceleb_trainer-master/utils.py", line 18, in accuracy
correct_k = correct[:k].view(-1).contiguous().float().sum(0, keepdim=True)
…
-
Hi @hbredin , great repo and talk!
I'm trying to perform diarization to detect number of speakers in a short question and answer clip.
Clips are between 3 and 10 seconds on average and I want to ver…
-
Thank you for your great work first! But I have some puzzle about keypoints in this model.
I know keypoints here are motion-specific, and in my understanding, one specific keypoint in different im…
-
when I am running ```drive.py``` as:
```
python3 drive.py outputs/tony_hawk_take_1/checkpoints/model_02715367.pth results/ --destination results/ --images_path DATASET_ROOT/images-cropped/personA/…
-
OK, I'm continuing my work with a language identifier that works with only 3 languages, but right now it's getting stuck in a rut - on epoch 3, trying to obtain results with audios of different langua…