JusperLee / CTCNet

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Apache License 2.0
70 stars 17 forks source link

AVSpeech Dataset #4

Open SutirthaChakraborty opened 10 months ago

SutirthaChakraborty commented 10 months ago

Hi, I have downloaded videos from the AVSpeech Datasets, how can I preprocess that to train this model ?

JusperLee commented 10 months ago

You can use a script like this one to complete the creation of the data: https://github.com/JusperLee/LRS3-For-Speech-Separation

SutirthaChakraborty commented 9 months ago

Do you have any alternative for Baidu Driver? I tried to download the dataset many times to download it. Hard to understand.