issues
search
lhotse-speech
/
lhotse
Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
956
stars
219
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Option to save audio in the original format when exporting to shar
#1422
anteju
opened
17 hours ago
0
File reading IO refactoring into backends
#1421
pzelasko
opened
1 day ago
0
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 4708: ordinal not in range(128)
#1420
njellinas
closed
1 day ago
2
change max_frames to max_duration in docs
#1419
pengzhendong
opened
2 days ago
1
minor fix
#1418
pengzhendong
closed
3 days ago
0
How to save a cut with the `to_mono` operation?
#1417
pengzhendong
closed
3 days ago
1
[Bug] The total number of supervisions decreased after trimming to supervision groups.
#1416
MartinKocour
opened
1 week ago
1
`concat_cuts` function does not concatenate text field for new cut
#1415
yfyeung
closed
1 week ago
2
Update lhotse.py
#1414
pengzhendong
closed
2 weeks ago
1
Features not including original recording_id when computing
#1411
njellinas
closed
3 weeks ago
2
[fix] fisher_english recipe
#1410
pengzhendong
closed
1 month ago
0
downgrading sphinx version from 7.2.6 to 7.1.2
#1409
annapovey
closed
1 month ago
0
Got Error when use Lhotse Shar data format for multi-GPU K2 model training
#1408
pengyizhou
opened
1 month ago
2
Storage of integer features + Feature extraction best practice
#1407
njellinas
closed
3 days ago
7
Add workflow: annotate DNSMOS P.835
#1406
yfyeung
closed
3 weeks ago
0
Installation problem python 3.8 and sphinx==7.2.6;
#1405
npovey
closed
3 weeks ago
2
Add the Emilia corpus
#1404
csukuangfj
closed
1 month ago
0
Lhotse Manifest Preparation Stuck and Incomplete for MLS English Train Set
#1403
mubtasimahasan
opened
1 month ago
2
Fleurs
#1402
m-wiesner
closed
1 month ago
2
how to set sampler when resume training?
#1401
TinaChen95
opened
1 month ago
0
Adds radio data recipe
#1400
m-wiesner
closed
1 month ago
2
On a large GPU cluster, DynamicBucketingSampler.__next__ spend a lot of time
#1399
shushanxingzhe
opened
1 month ago
1
Implement conversion from CutSet to HuggingFace dataset
#1398
domklement
closed
1 month ago
1
Bug in CutPairsSampler with CutSet.from_files(): Fails to raise StopIteration at the end of dataset iteration, raises `AttributeError: 'tuple' object has no attribute 'subset'`
#1396
Aijohc
opened
2 months ago
3
Add recipe for the Santa Barbara Corpus of Spoken American English (SBCSAE)
#1395
mmaciej2
closed
1 month ago
2
Fix ksponspeech recipe
#1394
yfyeung
closed
1 month ago
0
Fix cli for ksponspeech
#1393
yfyeung
closed
1 month ago
0
Fix backend to None while ffmpeg is unavailable.
#1392
pengzhendong
closed
1 month ago
3
spgi duration interprocess crash fix
#1391
pclpp
closed
2 months ago
1
[spgispeech] Fix durations object is null issue
#1390
frankyoujian
closed
2 months ago
1
AttributeError: 'NoneType' object has no attribute 'data'
#1389
Airgods
opened
2 months ago
6
Unknown manifest type error for `jsonl.gz` manifests
#1388
muradbozik
closed
1 month ago
2
Fix to fixed batch size bucketing and audio loading network connectio…
#1387
pzelasko
closed
3 months ago
0
[Recipe] Spatial LibriSpeech
#1386
JinZr
closed
3 months ago
1
add a param tolerance of cut.py simple
#1385
yunxinmengze
opened
3 months ago
3
[Recipe] Wenetspeech4tts
#1384
yuekaizhang
closed
3 months ago
1
Added has_custom to MixedCut
#1383
anteju
closed
3 months ago
0
Make torchaudio an optional dependency
#1382
pzelasko
closed
2 weeks ago
0
Include a copyright NOTICE listing major copyright holders
#1381
pzelasko
closed
3 months ago
1
`CutSet`.prefetch() for background cuts loading during iteration
#1380
pzelasko
closed
3 months ago
0
Cap the 'trng' random seeds to 2**31 avoiding numpy error
#1379
pzelasko
closed
3 months ago
0
Refactor bucket selection for customization
#1377
pzelasko
closed
4 months ago
0
MUSAN mix to current CutSet: Cannot load audio of cuts in a lazy CutSet.
#1376
njellinas
closed
3 months ago
3
Add EARS recipe
#1375
Ryu1845
closed
4 months ago
1
How to load parquet file effectively with Lhotse?
#1374
kobenaxie
closed
4 months ago
1
Concurrent dynamic bucketing
#1373
pzelasko
closed
4 months ago
0
Support for pre-determined batch sizes in DynamicBucketingSampler
#1372
pzelasko
closed
4 months ago
0
Read seperate .jsonl.gz from fbank filter them and make a Cutset into single variable.
#1371
sanjuktasr
opened
4 months ago
3
Fix MixedCut transforms serialization
#1370
pzelasko
closed
4 months ago
0
How does tar work with DynamicBucketingSampler?
#1368
orena1
closed
4 months ago
0
Next