-
Hi,
Tried executing your project ! Able to run all the python files without any errors, but how can I create a model for the same ? And no Model folder is generated inside the current working direc…
-
On aimerait beaucoup utiliser la fonction de conversion d'un Speaker setup A, en entrée, vers un Speaker setup B, en sortie, rendu possible en mode standalone grâce au Player de SpatGRIS, mais avec de…
led78 updated
1 month ago
-
При использовании библиотеки GnuTLS вместо OpenSSL и версии TLS 1.3 происходит разрыв соедиения с файлообмеником files.catbox.moe.
Zapret запущен непосредственно на компьютере. Конфиг zapret следую…
-
with the same wavenet model and the same utterence(p225_001.wav), i found that the quality of the waveform generated from the mel-spectrogram in provided metadata.pkl is much better than the one gener…
nkcdy updated
4 years ago
-
Hello! We use this mod on my server, and it adds a lot of great features, the speaker block being one of those. However, knowing how easily things can get out-of-control on a public server, we also us…
-
Darization runs very slowly, uses almost 12gb of memory, and is seemingly not happening on the GPU (GPUz and Window's task manager show conflicting info)
- Latest WhisperX repo
- pyannote.audio 3…
-
Hi Tan, I am working on my thesis on TTS, I found this is amazing, I am going to use speaker embeddings as one of feature for my experiment.
Do you have avaiable Checkpoints for Vietnamese, can you s…
-
I managed to use PCM 16 bit Voice data, Sampling rate is 44100Hz.
The recognition result always get true even the completely different sound data.
Sent with GitHawk
-
首先感谢你们团队对业界的贡献!
我已经使用erea2net_para模型在language identification任务上训练完了模型,测试集ACC也有95%.
目前打算将模型部署成一个API服务,能够接收音频文件返回语种结果,请问有什么建议吗?
另外,已经根据speakerlab/bin/下的脚本输出onnx文件,获得了extract_speaker_embedding、make_…
-
is this group still active ?