Pipeline - Githubissues

egorsmkv / asr-cc

Automatic Speech Recognition Corpus Creator

Apache License 2.0

0 stars 0 forks source link

Pipeline #6

Open egorsmkv opened 5 months ago

egorsmkv commented 5 months ago

A project ingests links to videos from YouTube
Then the queue downloads audio part of videos
The audio file is split by VAD to chunks
Chunks then are pseudo-labeled by a configured ASR backend