issues
search
egorsmkv
/
asr-cc
Automatic Speech Recognition Corpus Creator
Apache License 2.0
0
stars
0
forks
source link
Pipeline
#6
Open
egorsmkv
opened
5 months ago
egorsmkv
commented
5 months ago
A project ingests links to videos from YouTube
Then the queue downloads audio part of videos
The audio file is split by VAD to chunks
Chunks then are pseudo-labeled by a configured ASR backend