Open schxnhxlz opened 1 month ago
I did the same thing with deepspeech and I got the same result as you.
I did the same thing with deepspeech and I got the same result as you.
did you try it with deepspeech as well? Had the same issue there :/
Did you process the data with hubert before that, jus like:
CUDA_VISIBLE_DEVICES=3 python data_utils/process.py data/
Yes. Still the same issue. Im now trying to train longer.
I did the same thing with deepspeech and I got the same result as you.
Check the sampling rate of your audio file. mine was 48000. i converted it to 16000 and it worked.
Hi there,
I trained a Video with hubert. Everything looks good so far. But when I try to inference with audio (converted to .npy with
python data_utils/hubert.py --wav data/<name>.wav # save to data/<name>_hu.npy
it creates me a 1 minute video without audio and wrong lip movements. anything I missed here?cheers