issues
search
facebookresearch
/
voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
Other
510
stars
50
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Get the decoded text for segmenting the labeled data
#45
AvivNavon
opened
1 year ago
0
Why are some segments marked as invalid?
#44
MartinKocour
opened
1 year ago
0
Download scripts do not work with pytorchaudio>=2.0.0
#43
Whyki
opened
1 year ago
3
Update requirements.txt
#42
raivisdejus
opened
1 year ago
2
Wrong segmentation of data in the Italian dataset
#41
giampierosalvi
opened
1 year ago
0
CVE-2007-4559 Patch
#40
TrellixVulnTeam
opened
1 year ago
0
without train code
#39
piaohe111
opened
2 years ago
0
Issue getting English accented data
#38
bethant9
opened
2 years ago
0
S2S data prepare problem
#37
LucasWangZH
closed
2 years ago
1
Croatian ASR data missing half of raw transcripts and characters with diacritics
#36
nljubesi
opened
2 years ago
1
Add Accented VoxPopuli dataset
#35
JadeCopet
closed
2 years ago
3
Facing problem during downloading dataset
#34
Lalaramarya
opened
2 years ago
7
size of the french labeled dataset in voxpopuli
#33
wiamfa
closed
2 years ago
2
cannot import name 'LANGUAGES' from 'voxpopuli'
#32
wiamfa
closed
2 years ago
2
Cannot download ASR models for Et and Lt
#31
mthrok
opened
3 years ago
1
Same speaker_id appear in multiple languages
#30
weedwind
opened
3 years ago
0
Trouble loading pretrained checkpoints
#29
joklaff
opened
3 years ago
0
WER mismatch in EN ASR model
#28
ankitapasad
opened
3 years ago
0
Add v2 data
#27
kahne
closed
3 years ago
0
get_asr_data.py can't open output file
#26
dpoljak
closed
3 years ago
3
get_asr_data script fails
#25
Robotuks
closed
3 years ago
3
Add gender column to ASR manifests
#24
kahne
closed
3 years ago
0
DOC: pointers to the wav2letter implementation
#23
Molugan
closed
3 years ago
0
403 error when downloading da 2019
#22
jfainberg
closed
3 years ago
2
Add self-trained ASR and speech-to-text translation models
#21
kahne
closed
3 years ago
1
Adding VoxPopuli ASR models
#20
kahne
closed
3 years ago
2
release LMs
#19
an918tw
closed
3 years ago
0
Question about Speaker ID Label
#18
S-GH
closed
3 years ago
2
Is there speaker annotations in unlabeled data?
#17
DongChanS
closed
3 years ago
2
europarl tools missing while running voxpopuli.get_lm_data
#16
cyrta
closed
3 years ago
1
Error Parsing Data
#15
snakers4
closed
3 years ago
3
Bugfix cut from labels
#14
Molugan
closed
3 years ago
0
BUGFIX: updating the file architecture
#13
Molugan
closed
3 years ago
0
LM training update
#12
an918tw
closed
3 years ago
0
Update README; add/update data download/processing scripts
#11
kahne
closed
3 years ago
1
text normalization script for LM data preparation
#10
an918tw
closed
3 years ago
0
[DOC][ASR]: Links to retrieve the data for the PER experiments
#9
Molugan
closed
3 years ago
0
[FEATURE][ASR]: Wav2letter doc and cpp
#8
Molugan
closed
3 years ago
1
[DOC] Wav2letter checkpoint
#7
Molugan
closed
3 years ago
1
[FEATURE][SEGMENTATION] : Segment the data using a .tsv file
#6
Molugan
closed
3 years ago
0
[FEATURE][SEGMENTATION] : Cut unlabelled data
#5
Molugan
closed
3 years ago
0
[FEATURE][SEGMENTATION]: cut labelled data
#4
Molugan
closed
3 years ago
0
[FEATURE][SEGMENTATION] : Stage 1 - run_pyannote_sd
#3
Molugan
closed
3 years ago
0
Adding Code of Conduct file
#2
facebook-github-bot
closed
3 years ago
0
Adding Contributing file
#1
facebook-github-bot
closed
3 years ago
0