lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
929 stars 212 forks source link

Unknown manifest type error for `jsonl.gz` manifests #1388

Open muradbozik opened 2 weeks ago

muradbozik commented 2 weeks ago

I have created a dataset using a custom feature extractor and seved the cutset to file with the extension .jsonl.gz. I can load the manifest in version 1.24.1 perfectly fine. But versions 1.26.0 and 1.27.0 throws the error Unknown manifest type 'jsonl.gz'. I didn't dig the release changes if support was dropped for gzip files or not. Just reporting if this is a problem. I stick with the version 1.24.1 for now.

Feel free to close the issue, if this was expected behaviour. Thanks.

pzelasko commented 2 weeks ago

It’s not expected. Can you share the manifest (or a part of it) to reproduce?