egorsmkv / asr-cc

Automatic Speech Recognition Corpus Creator
Apache License 2.0
0 stars 0 forks source link

Add the VAD service #12

Open egorsmkv opened 4 months ago

egorsmkv commented 4 months ago

It ingests a path to audio file and saves timestamps of speech in the database

egorsmkv commented 4 months ago

Later, an audio file is splitted by timestamps (skipping min/max outliers)