issues
search
jishengpeng
/
WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
MIT License
650
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Maximum duration supported during inference?
#31
LiuShixing
opened
18 hours ago
0
How many hours of Chinese data are there?
#30
LiuShixing
closed
2 days ago
1
Usage for speech separation and temporal audio features
#29
saveriyo
opened
3 days ago
4
Traning on wenetspeech couldn‘t converge
#28
dyyoungg
opened
3 days ago
2
Comparison with Whisper
#27
isruihu
opened
3 days ago
1
support lightning 2.x or above
#26
nukes
opened
4 days ago
0
What is the difference between the config for training WavTokenizer-small and WavTokenizer-large?
#25
handsomelys
opened
4 days ago
2
Fix DAC training
#24
erogol
opened
5 days ago
0
WavTokenizer-mdium is release on 2024.09.09
#23
jishengpeng
opened
5 days ago
4
Alignment language vocabulary and speech space
#22
varfolomeeff
opened
1 week ago
1
Future 48kHz model
#21
Ronsor
opened
1 week ago
1
The loss value when the model converges
#20
yangyyt
opened
1 week ago
10
encounter shape inconsistent in training 16kHz
#19
dyyoungg
closed
1 day ago
3
Mel or wav?
#18
howitry
opened
1 week ago
1
Purpose of os.environ['CUDA_LAUNCH_BLOCKING'] = '1' in train.py
#17
seastar105
closed
1 week ago
1
Weight of model
#16
JoyceMind
opened
1 week ago
1
Please consider about 16K model?
#15
ywh-my
opened
1 week ago
1
Upgrade to Pytorch Lightning 2.0+ and make pip installable
#14
saveriyo
opened
1 week ago
2
About infer in GPU
#13
JohnFengNeumann
opened
1 week ago
1
fail to install
#12
JoyceMind
opened
1 week ago
2
Installable package
#11
Tomiinek
opened
1 week ago
0
MRD vs MS-STFTD
#10
Yagelmx
opened
1 week ago
4
Convert to package and add libritts data prep script
#9
saveriyo
closed
1 week ago
2
encode and decode for "16k sample"
#8
sunnnnnnnny
closed
1 week ago
1
Some notes on HF integration
#7
NielsRogge
opened
1 week ago
1
About ASR
#6
wntg
opened
1 week ago
4
Quality on lower bandwidth?
#5
OnceJune
opened
1 week ago
3
Abnormal audio exists in generated audio
#4
xinkez
closed
1 week ago
2
Question about infer
#3
handsomelys
opened
2 weeks ago
4
Questions about more detailed experimental results
#2
hbwu-ntu
opened
2 weeks ago
2
Release of the bigger models :)
#1
christophschuhmann
opened
2 weeks ago
1