issues
search
lhotse-speech
/
lhotse
Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
904
stars
204
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Duplicate manifest id after mixing
#1265
MarcoMultichannel
closed
5 months ago
4
Presumed root dir '/scratch' in LibriMix recipe, unable to change?
#1264
frdysf
closed
5 months ago
1
Added handlings for negative end time (#1203)
#1263
ArthLeu
opened
5 months ago
2
How to write a custom CutTransform ?
#1262
kobenaxie
closed
5 months ago
2
Fix non-deterministic tests
#1261
pzelasko
closed
5 months ago
0
support whisper large v3; deepspeed launcher rank world_size setting
#1260
yuekaizhang
closed
5 months ago
0
FileNotFoundError: [Errno 2] Unable to synchronously open file
#1259
Mahaotian1
opened
5 months ago
3
Computing fbanks fails in ami and libricss recipes because data/fbank folder is not created beforehand
#1258
kfmn
closed
5 months ago
0
Wrong url in isci recipe
#1257
kfmn
opened
5 months ago
2
how to delete some monocuts in cutsets?
#1256
Mahaotian1
closed
6 months ago
2
Support resampling and `CutSet.save_audios` when torchaudio is missing
#1255
pzelasko
closed
6 months ago
0
Audio range out of (-1,+1)
#1254
KarelVesely84
opened
6 months ago
2
Allow creating dummy test data without torchaudio
#1253
pzelasko
closed
6 months ago
0
Update docs with env vars used by Lhotse
#1252
pzelasko
closed
6 months ago
0
Bump dev version to 1.20.0
#1251
pzelasko
closed
6 months ago
0
The number of declared samples in the recording diverged from the one obtained when loading audio
#1250
wwwei1997
closed
6 months ago
2
Fix `normalize_loudness` for MixedCuts with PaddingCuts
#1249
pzelasko
closed
6 months ago
0
Support multiplexing with a limited number of open streams
#1248
pzelasko
closed
6 months ago
1
Allowing downloading Edin. ver. of VCTK
#1247
JinZr
closed
6 months ago
0
`CutSampler.map()` for transforming `CutSet` mini-batches
#1246
pzelasko
closed
6 months ago
0
Drop python3.7 support
#1245
pzelasko
closed
6 months ago
0
Perform CutSet.mix() lazily
#1244
pzelasko
closed
6 months ago
0
updating the voxpopuli recipe
#1243
KarelVesely84
closed
6 months ago
0
Update mgb2.py
#1242
Shymaa2611
closed
4 months ago
1
Add dataset for audio tagging
#1241
marcoyang1998
closed
3 months ago
1
Problem with CutSet.from_manifests
#1240
juliendespres
opened
6 months ago
6
Default encoding change
#1239
zzasdf
opened
6 months ago
3
Support for OPUS encoding in Lhotse Shar format
#1238
pzelasko
closed
6 months ago
0
Micro-optimization for LazyJsonlIterator len()
#1237
pzelasko
closed
6 months ago
0
Trained on different datasets but with the same checkpoint model
#1236
OswaldoBornemann
closed
6 months ago
4
support icmc eval track 1
#1235
yuekaizhang
closed
6 months ago
0
Bump dev version to 1.19.0
#1234
pzelasko
closed
7 months ago
0
Does lhotse support chunk-wise dataset/dataloader?
#1233
JinZr
closed
7 months ago
2
Use `attacut` module for Thai word tokenization (in MMS forced alignment)
#1232
flyingleafe
closed
7 months ago
0
Allow lhotse installation without torchaudio for a limited set of features
#1231
pzelasko
closed
7 months ago
1
Question about Recommended Method for Using Alignments
#1230
teowenshen
closed
6 months ago
1
Sampling rate conversion
#1229
bsshruthi22
opened
7 months ago
3
Computing features on truncated files
#1228
ezerhouni
closed
6 months ago
2
Updating lhotse caused some errors when reading data
#1227
lucy9527
opened
7 months ago
2
Sample length exception should maybe be using DurationMismatchError?
#1225
RuABraun
closed
7 months ago
2
[BUG] Deadloop on `LazyRepeater` for non re-iterable.
#1222
chenjiasheng
opened
7 months ago
2
save sdm files into a single mdm file to do gss
#1221
yuekaizhang
closed
7 months ago
2
More flexible setting of audio backends
#1219
pzelasko
closed
7 months ago
0
Query on Extracting Features for Specific Intervals from Recordings
#1218
sangeet2020
closed
7 months ago
2
RFC - Python 3.7 end of life and Python 3.8 new syntax features
#1217
pzelasko
closed
6 months ago
3
Fix audio backend selection
#1216
pzelasko
closed
7 months ago
0
Feature extraction is slow because of slow job submittal
#1215
RuABraun
closed
7 months ago
2
CLI to estimate and print bucket bins for a cut set
#1214
pzelasko
closed
7 months ago
0
Fixed text normalization for `tal_csasr`
#1213
JinZr
closed
7 months ago
0
Add recipe for Medical Corpus
#1212
yfyeung
closed
7 months ago
0
Previous
Next