-
https://nidwbin.xyz/post/42313/
生命不息,折腾不止!
-
统计 开源数据 和 爬虫源, 不断更新中... 欢迎追加编辑
-
for the 16kHz Codec model: the bitrate is 2kbps;
for the 44.1kHz Codec model: the bitrate is 6.89kbps;
for the 48kHz Codec model: the bitrate is 7.5kbps;
#1、Here is the exps/results.txt
Codec SU…
-
# Task Name
Japanese Pitch Accent Word Recognition
## Task Objective
This task aims to recognize words in Japanese audio that have different meanings based on pitch accent. Japanese pitch accent …
-
Hi, thanks for repository!
I have 2 questions:
1. Is there any specific reason why kaldi-fbanks was used to train/inference model? Did you tried other realizations, like from the torchaudio itself …
-
Hi, sir. I don't think that the perm and unperm acitions in your code make any difference. Because the perm action is along the batch dim, in the forward process, the different data along the batch d…
-
Use DataParallel model to start a multi-gpu training, change the config.yaml batch size, can not speed up the training.
-
-
This might be a silly question so I willl begin with apology.
I am new to speaker verification and I am trying to apply this repo for VoxCeleb1.
DataLoading and other stuffs seems trivial but I have…
-
## Intro
- We must generate a space to learn and share in a guided and organized way.
- Our goal is to benefit as many people as possible and create a continuous collaboration between everyone to gro…