-
I am implementing a script to extract video-only, audio-only and audio-visual embeddings using a AV-HuBERT checkpoint. Specifically, I am using the one fine-tuned to the AVSR downstream task: ```large…
-
Hi, I did the following steps but got the below error.
step-1: I executed the crop_mouth_from_video.py from the preprocess directory and got the .npz files.
step-2: Executed the script related to v…
-
I run the code on my dataset in Colab but I faced this issue , it is not work with GPU but when I delete all cude in main.py it is work well, I want applied it on GPU because very slowly , so can help…
-
Hi , I‘m a beginner in lipreading. I'm curious how low the latency of lip recognition can be? Is there any solution to reduce the delay?
Thank you very much.
-
Hi, thanks for the code!
I want to do a test with your pretrained model on the LRW dataset.
Firstly, I run the crop_mouth_from_video.py and got many npz under “$TCN_LIPREADING_ROOT/datasets/visu…
-
Hey!
Myself Archisman, GSSoC'23
**Describe the project you want to add with tech stack**
It will be a convolutional neural network that is trained on a large dataset of videos of people speaking.…
-
Hello
So, attempting to use the GPU with a sample video trying to perform lip reading with this simple command:
`python main.py --config-filename configs/LRS3_V_WER32.3.ini --data-filename test_v…
-
Could you provide the `20words_mean_face.npy` you used?
-
Thanks for your great works. At the end of Readme, you pointed out the test command in the LRS3 dataset. But I can not find the script named run_av_hubert.sh. So how do I verify the performance of the…
-
Typo in the visual instructions of 1:40 - 2:05 - "We see Martine having difficulty understanding a page with with long and justified paragraphs," (duplicate "with").
Overall question about Martine'…