lhotse-speech lhotse issues

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

https://lhotse.readthedocs.io/en/latest/

Apache License 2.0

956 stars 219 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Option to save audio in the original format when exporting to shar

#1422 anteju opened 17 hours ago
0
File reading IO refactoring into backends

#1421 pzelasko opened 1 day ago
0
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 4708: ordinal not in range(128)

#1420 njellinas closed 1 day ago
2
change max_frames to max_duration in docs

#1419 pengzhendong opened 2 days ago
1
minor fix

#1418 pengzhendong closed 3 days ago
0
How to save a cut with the `to_mono` operation?

#1417 pengzhendong closed 3 days ago
1
[Bug] The total number of supervisions decreased after trimming to supervision groups.

#1416 MartinKocour opened 1 week ago
1
`concat_cuts` function does not concatenate text field for new cut

#1415 yfyeung closed 1 week ago
2
Update lhotse.py

#1414 pengzhendong closed 2 weeks ago
1
Features not including original recording_id when computing

#1411 njellinas closed 3 weeks ago
2
[fix] fisher_english recipe

#1410 pengzhendong closed 1 month ago
0
downgrading sphinx version from 7.2.6 to 7.1.2

#1409 annapovey closed 1 month ago
0
Got Error when use Lhotse Shar data format for multi-GPU K2 model training

#1408 pengyizhou opened 1 month ago
2
Storage of integer features + Feature extraction best practice

#1407 njellinas closed 3 days ago
7
Add workflow: annotate DNSMOS P.835

#1406 yfyeung closed 3 weeks ago
0
Installation problem python 3.8 and sphinx==7.2.6;

#1405 npovey closed 3 weeks ago
2
Add the Emilia corpus

#1404 csukuangfj closed 1 month ago
0
Lhotse Manifest Preparation Stuck and Incomplete for MLS English Train Set

#1403 mubtasimahasan opened 1 month ago
2
Fleurs

#1402 m-wiesner closed 1 month ago
2
how to set sampler when resume training?

#1401 TinaChen95 opened 1 month ago
0
Adds radio data recipe

#1400 m-wiesner closed 1 month ago
2
On a large GPU cluster, DynamicBucketingSampler.__next__ spend a lot of time

#1399 shushanxingzhe opened 1 month ago
1
Implement conversion from CutSet to HuggingFace dataset

#1398 domklement closed 1 month ago
1
Bug in CutPairsSampler with CutSet.from_files(): Fails to raise StopIteration at the end of dataset iteration, raises `AttributeError: 'tuple' object has no attribute 'subset'`

#1396 Aijohc opened 2 months ago
3
Add recipe for the Santa Barbara Corpus of Spoken American English (SBCSAE)

#1395 mmaciej2 closed 1 month ago
2
Fix ksponspeech recipe

#1394 yfyeung closed 1 month ago
0
Fix cli for ksponspeech

#1393 yfyeung closed 1 month ago
0
Fix backend to None while ffmpeg is unavailable.

#1392 pengzhendong closed 1 month ago
3
spgi duration interprocess crash fix

#1391 pclpp closed 2 months ago
1
[spgispeech] Fix durations object is null issue

#1390 frankyoujian closed 2 months ago
1
AttributeError: 'NoneType' object has no attribute 'data'

#1389 Airgods opened 2 months ago
6
Unknown manifest type error for `jsonl.gz` manifests

#1388 muradbozik closed 1 month ago
2
Fix to fixed batch size bucketing and audio loading network connectio…

#1387 pzelasko closed 3 months ago
0
[Recipe] Spatial LibriSpeech

#1386 JinZr closed 3 months ago
1
add a param tolerance of cut.py simple

#1385 yunxinmengze opened 3 months ago
3
[Recipe] Wenetspeech4tts

#1384 yuekaizhang closed 3 months ago
1
Added has_custom to MixedCut

#1383 anteju closed 3 months ago
0
Make torchaudio an optional dependency

#1382 pzelasko closed 2 weeks ago
0
Include a copyright NOTICE listing major copyright holders

#1381 pzelasko closed 3 months ago
1
`CutSet`.prefetch() for background cuts loading during iteration

#1380 pzelasko closed 3 months ago
0
Cap the 'trng' random seeds to 2**31 avoiding numpy error

#1379 pzelasko closed 3 months ago
0
Refactor bucket selection for customization

#1377 pzelasko closed 4 months ago
0
MUSAN mix to current CutSet: Cannot load audio of cuts in a lazy CutSet.

#1376 njellinas closed 3 months ago
3
Add EARS recipe

#1375 Ryu1845 closed 4 months ago
1
How to load parquet file effectively with Lhotse?

#1374 kobenaxie closed 4 months ago
1
Concurrent dynamic bucketing

#1373 pzelasko closed 4 months ago
0
Support for pre-determined batch sizes in DynamicBucketingSampler

#1372 pzelasko closed 4 months ago
0
Read seperate .jsonl.gz from fbank filter them and make a Cutset into single variable.

#1371 sanjuktasr opened 4 months ago
3
Fix MixedCut transforms serialization

#1370 pzelasko closed 4 months ago
0
How does tar work with DynamicBucketingSampler?

#1368 orena1 closed 4 months ago
0