-
Recently, I have been conducting applied research on Target Speaker Extraction, but I have encountered many difficulties. I came across your paper titled 'Generative Speech Foundation Model Pretrainin…
-
It would be great if voice improvement/restauration models could also be integrated into this amazing piece of software.
For example there's this MIT licensed tool called voicefixer: https://github.c…
-
# 说明
PaddleSpeech 是基于飞桨 PaddlePaddle 的语音方向的开源套件,囊括语音识别、语音合成、语音唤醒、声纹识别等多种语音常用功能的支持。由于近期 Paddle 新版本的升级存在不兼容部分(如 paddle.fluid API 全面退场,PIR + predictor 升级, 0-d tensor,view 行为修改等),需要重新对 PaddleSpeech 中的模型进…
-
`👏🏻 2022.05.06: PaddleSpeech Streaming Server is available for Streaming ASR with Punctuation Restoration and Token Timestamp and Text-to-Speech.`
请问为什么本地的ASRExecutor识别没有Timestamp功能?未来是否会添加,谢谢!
-
Hello! Awesome project, and I totally understand that this isn't your main focus anymore, but I just love the results this gives over almost everything else I've tried for speech restoration.
Howev…
-
The most reliable example I could find is `Is that true?` (but not `Is that true.`), where the first word is always omitted. It happens for all voices in this model, regardless of other settings. With…
Sobsz updated
5 months ago
-
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation
https://arxiv.org/abs/2410.11841
-
Hey guys, really cool work! I'm an engineer at [Sieve](http://sievedata.com/) and we've been working with lip-syncing tech for some time now. We were quite impressed by the capabilities of MuseTalk an…
-
Hello everyone!
After training, I save conformer model to SavedModel format by code below:
```python
config = Config(config_path)
speech_featurizer = TFSpeechFeaturizer(config.speech_config)
text…
-
In Android SpeechRecognizer emits (horrible) BIP sound when starting and ending recognition.
This by design but it would be awesome to be able to remove / mute these bips.