issues
search
lhotse-speech
/
lhotse
Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
936
stars
214
forks
source link
updating the voxpopuli recipe
#1243
Closed
KarelVesely84
closed
9 months ago
KarelVesely84
commented
9 months ago
allow to use pre-downloaded transcripts (so data praparation can be run without Internet access)
transcripts .tgz is downloaded into
/manifests
, and not into tmp
search the audio data in folder:
corpus_dir / "raw_audios" / lang
/manifests
, and not into tmpcorpus_dir / "raw_audios" / lang