issues
search
lhotse-speech
/
lhotse
Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
901
stars
204
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to combine with huggingface audio datasets?
#1366
yuekaizhang
opened
20 hours ago
0
Add GigaSpeech 2 recipe
#1365
yfyeung
opened
4 days ago
0
Feature calculation process crashing with large dataset
#1364
duhtapioca
opened
6 days ago
1
dataloader slow with shar
#1363
tianchaolangzi
closed
1 week ago
3
Numpy 2.0 compatibility
#1362
pzelasko
closed
1 week ago
0
Utils for discovering attached data and dropping in-memory data
#1361
pzelasko
closed
1 week ago
1
Restoring smart open for local files if available
#1360
pzelasko
closed
1 week ago
1
Support for Video Features, for example How2Sign
#1359
kerolos
opened
1 week ago
1
OSError: [Errno 9] Unable to synchronously open file (unable to lock file, errno = 9, error message = 'Bad file descriptor')
#1358
Mahaotian1
opened
2 weeks ago
1
UnicodeEncodeError: 'ascii' codec can't encode characters in position 505-506: ordinal not in range(128)
#1357
chiiyeh
opened
2 weeks ago
1
Fix Recording.to_dict() when transforms are dicts and transform pickling issues
#1355
pzelasko
closed
1 week ago
1
Support for reading data from AIStore using Python SDK
#1354
pzelasko
closed
3 weeks ago
0
Add KsponSpeech recipe
#1353
whsqkaak
closed
2 weeks ago
2
error in window 11 installation
#1352
xalteropsx
closed
3 weeks ago
1
AttributeError: 'dict' object has no attribute 'to_dict'
#1351
lalimili6
opened
3 weeks ago
2
Multiple feature extractors in a single Cut
#1350
njellinas
opened
3 weeks ago
1
Increase the start diff tolerance for feature loading
#1349
pzelasko
closed
4 weeks ago
0
augmentation/torchaudio: add Phone effect (mulaw, lpc10 codecs)
#1348
rouseabout
opened
1 month ago
4
Fix one-off edge case in split_lazy
#1347
pzelasko
closed
1 month ago
0
'ascii' codec can't encode characters in position 219-247 in processing wenet speech dateset
#1346
wgfi110
closed
1 month ago
1
More test coverage for lhotse subset
#1345
pzelasko
closed
3 weeks ago
0
Add new sampler: weighted sampler
#1344
marcoyang1998
closed
3 weeks ago
2
Fix librispeech manifest caching
#1343
haerski
closed
1 month ago
0
PR #1332 breaks many operations
#1342
JinZr
closed
1 month ago
1
Dynamic bucket selection rng sync
#1341
pzelasko
closed
3 weeks ago
11
Fix describe on cuts
#1340
keeofkoo
closed
1 month ago
1
Describe on cuts does not display supervision custom info
#1339
keeofkoo
closed
1 month ago
0
Create a custom audio transformation
#1338
njellinas
closed
3 weeks ago
1
Pytorch dataloader cannot compute length
#1337
njellinas
closed
3 weeks ago
2
Missing 'subset' parameter
#1336
daniel-dona
closed
1 month ago
0
Use libsndfile in recording chunk dataset
#1335
pzelasko
closed
1 month ago
0
How to split manifests into several parts
#1334
OswaldoBornemann
opened
1 month ago
1
Experiencing memory leakage with MixedCut.load_audio
#1333
hereismohsen
closed
1 month ago
2
`reverb_rir`: support Cut input and in memory data
#1332
pzelasko
closed
1 month ago
0
Is there any possible to use multi GPU to inside the compute_and_store_features_batch function
#1331
OswaldoBornemann
opened
1 month ago
3
Add the ReazonSpeech recipe
#1330
Triplecq
closed
1 month ago
5
Bump dev version to 1.24.0
#1329
pzelasko
closed
2 months ago
0
In CommonVoice corpus, use .tsv headers to parse and not column index
#1328
daniel-dona
closed
2 months ago
3
it takes too long for DynamicBucketingSampler to load state dict
#1327
Mahaotian1
opened
2 months ago
5
A problem occurred while processing WenetSpeech:input tensor must fit into 32-bit index math
#1326
codeking233
opened
2 months ago
1
Common voice wrong metadata added to supervision set
#1325
Roagen7
opened
2 months ago
3
Common Voice download fails with a 403 error
#1324
daniel-dona
opened
2 months ago
2
Fix export of features/array to shar
#1323
pzelasko
closed
2 months ago
0
Fix `trim_to_supervision_groups`
#1322
pzelasko
closed
2 months ago
0
AttributeError: No such attribute: to_eager
#1321
TinaChen95
closed
2 months ago
1
[shar] cut can't load feature
#1320
kamirdin
closed
2 months ago
1
how to do feature extraction for each supervision?
#1319
TinaChen95
opened
2 months ago
1
Allow skipping missing files in AMI download
#1318
pzelasko
closed
2 months ago
0
select a random sub-region of the noise based on the delta duration
#1317
osadj
closed
2 months ago
1
Fix randomness in CutMix transform
#1316
pzelasko
closed
3 months ago
0
Next