lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
936 stars 214 forks source link

`AudioBackend` specific `save_audio` and `info`, managing missing SoX in torchaudio, Python 3.12 / PyTorch 2.2 support, using `libsndfile` as preferred audio backend #1288

Closed pzelasko closed 7 months ago

pzelasko commented 7 months ago

What was intended as a fix to some edge case turned into a larger refactoring so I'll release a new lhotse version once this is merged.

pzelasko commented 7 months ago

The failed tests are known to be flaky. In addition to unit tests, I tested these changes on actual data with different backends and formats. Merging