lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
956 stars 219 forks source link

[Recipe] Spatial LibriSpeech #1386

Closed JinZr closed 3 months ago

JinZr commented 3 months ago

Support: Spatial LibriSpeech

Spatial LibriSpeech, is a spatial audio dataset with over 650 hours of first-order ambisonics, and optional distractor noise (with raw 19-channel audio coming soon). Spatial LibriSpeech is designed for machine learning model training, and it includes labels for source position, speaking direction, room acoustics and geometry. Spatial LibriSpeech was generated by augmenting LibriSpeech samples with 200k+ simulated acoustic conditions across 8k+ synthetic rooms.

JinZr commented 3 months ago

thank you for reviewing!

the concerns have been resolved now

Best Regards Jin

On Wed, 14 Aug 2024 at 10:32 Piotr Żelasko @.***> wrote:

@.**** requested changes on this pull request.

Thanks for the contribution! I left a couple of comments.

— Reply to this email directly, view it on GitHub https://github.com/lhotse-speech/lhotse/pull/1386#pullrequestreview-2237050622, or unsubscribe https://github.com/notifications/unsubscribe-auth/AOON42ES5RP7JLIXX2JSF3DZRK6U3AVCNFSM6AAAAABMJ5RNIOVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDEMZXGA2TANRSGI . You are receiving this because you authored the thread.Message ID: @.***>