issues
search
lhotse-speech
/
lhotse
Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
902
stars
204
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
select a random sub-region of the noise based on the delta duration
#1317
osadj
closed
2 months ago
1
Fix randomness in CutMix transform
#1316
pzelasko
closed
3 months ago
0
Enhance `CutSet.mix()` randomness and data utilization
#1315
pzelasko
closed
3 months ago
0
fix limited sampling range
#1314
kamirdin
closed
3 months ago
0
fix the truncation logic in LazyCutMixer
#1313
osadj
closed
3 months ago
0
Cut mixing issues
#1312
osadj
closed
3 months ago
8
extract feature failed
#1311
zhazhuanling12
closed
2 months ago
3
Librimix dataset
#1310
AntoineBlanot
opened
3 months ago
1
More similar mean batch duration across nodes with DynamicBucketingSampler in multi-GPU training
#1309
lifeiteng
closed
4 weeks ago
4
Fix typo in README.md
#1308
yfyeung
closed
3 months ago
0
ASR,ST and CS recipies
#1307
AmirHussein96
opened
3 months ago
1
Fixing recording move to memory
#1306
Tomiinek
opened
3 months ago
0
Updated text_norm for `aishell` recipe
#1305
JinZr
closed
3 months ago
1
Add Chinese TTS dataset `baker`.
#1304
csukuangfj
closed
2 months ago
1
Fix _get_strided_batch device
#1303
lifeiteng
closed
3 months ago
1
MDCC recipe
#1302
JinZr
closed
3 months ago
1
Bump dev version to 1.23.0
#1301
pzelasko
closed
3 months ago
0
Xfail flaky SileroVAD tests
#1300
pzelasko
closed
3 months ago
0
Channel selection for multi-channel custom recording fields
#1299
pzelasko
closed
3 months ago
1
Fix loading multi-channel custom recording fields in multi cuts
#1298
pzelasko
closed
4 months ago
1
Add new recipe: speechio
#1297
yuekaizhang
closed
3 months ago
0
tedlium2 recipe
#1296
JinZr
closed
3 months ago
0
Extending Lhotse dataloading to text/multimodal data
#1295
pzelasko
closed
3 months ago
1
Fix feature_dim of Spectrogram extractors.
#1294
csukuangfj
closed
4 months ago
0
Function to merge short sentences into a long sentence
#1293
yuguochencuc
opened
4 months ago
3
Cutconcat fixed max duration
#1292
swigls
closed
4 months ago
1
Documentation for random seeds in lhotse + extended support of lazy r…
#1291
pzelasko
closed
4 months ago
0
Use audio backends and export custom fields in Lhotse Shar
#1290
pzelasko
closed
4 months ago
0
fix whisper for multi-channel data
#1289
yuekaizhang
closed
3 months ago
1
`AudioBackend` specific `save_audio` and `info`, managing missing SoX in torchaudio, Python 3.12 / PyTorch 2.2 support, using `libsndfile` as preferred audio backend
#1288
pzelasko
closed
4 months ago
1
Handle error with cachedir creation gracefully
#1287
pzelasko
closed
4 months ago
0
How to split long cuts into shorter ones without messing supervisions up?
#1286
mohsen-goodarzi
closed
4 months ago
2
Question about WebDataset
#1285
Ryu1845
closed
4 months ago
2
Fixes for manifest validation and fixing
#1284
pzelasko
closed
4 months ago
1
Download AMI (SDM): HTTP Error 404: Not Found
#1282
AntoineBlanot
closed
2 months ago
18
fix_manifests function cost much time
#1281
xiangxyq
closed
4 months ago
3
Add VAD to Supervisions in LibriLight Recipe
#1280
yfyeung
closed
5 months ago
2
Allow duplicate cut IDs in a CutSet (CutSet is list-like instead of dict-like)
#1279
pzelasko
closed
5 months ago
0
Enable seed randomization in dynamic samplers
#1278
pzelasko
closed
5 months ago
1
Last mini-batch redistribution in distributed samplers
#1277
pzelasko
closed
5 months ago
0
Merge shuffling and bucketing buffers in DynamicBucketingSampler
#1276
pzelasko
closed
5 months ago
2
Install kaldi-native-io explicitly in the kaldi doc example.
#1275
csukuangfj
closed
5 months ago
1
[WIP] Adds preliminary sbcsae (Santa Barbara Corpus of Spoken American English) recipe
#1274
m-wiesner
opened
5 months ago
0
Fluent Speech Commands dataset, SLU task
#1272
HSTEHSTEHSTE
closed
5 months ago
4
Duplicate Manifest ID / Mux
#1271
m-wiesner
opened
5 months ago
1
Fix distributed sampler initialization and `exceeded` sampler warning false positives
#1270
pzelasko
closed
5 months ago
1
Too many times of warning about time_constraint.exceeded(), and training stops quite early .
#1269
kobenaxie
closed
5 months ago
7
Fix duplication issues in CutSet.mix()
#1268
pzelasko
closed
5 months ago
0
Question about the returns of CutSet.mix() ?
#1267
kobenaxie
closed
5 months ago
1
Support controllable `CutSet.mux` weights in multiprocess dataloading
#1266
pzelasko
closed
5 months ago
1
Previous
Next