tencent-ailab / pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Apache License 2.0
338 stars 57 forks source link

Suggestion RE data prep #2

Closed danpovey closed 3 years ago

danpovey commented 3 years ago

If you use lhotse for data-preparation (https://github.com/lhotse-speech/lhotse) dependency management may be less painful; it's designed for easy use from Python.

cweng6 commented 3 years ago

Thank you, Dan. We will look at lhotse to see if it'll make the pipeline cleaner.