JusperLee / CTCNet

An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
Apache License 2.0
68 stars 16 forks source link

AVSpeech Dataset #4

Open SutirthaChakraborty opened 8 months ago

SutirthaChakraborty commented 8 months ago

Hi, I have downloaded videos from the AVSpeech Datasets, how can I preprocess that to train this model ?

JusperLee commented 8 months ago

You can use a script like this one to complete the creation of the data: https://github.com/JusperLee/LRS3-For-Speech-Separation

SutirthaChakraborty commented 7 months ago

Do you have any alternative for Baidu Driver? I tried to download the dataset many times to download it. Hard to understand.