-
你好,你的“IMPROVING END-TO-END CONTEXTUAL SPEECH RECOGNITION WITH FINE-GRAINED CONTEXTUAL KNOWLEDGE SELECTION”这篇工作很受启发,我们也在做相关的工作,希望能复现一下你的方法,注意到你仓库只有英文的数据集,能否共享中文的数据集,感谢,email:lishaojun18@huawei.com
-
I have the following 5 second audio (it's a video because silly github does not support uploading audio, you can extract the audio by `ffmpeg -i short.mp4 -vn short.wav`):
https://github.com/facebo…
fumin updated
8 months ago
-
Dataloader name: `fleurs/fleurs.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?fleurs
| Dataset| fleurs |
|-------------|---|
| Description | Fleurs dataset is a part o…
-
Hi Team,
In Lex V2 bot currently when input mode is speech, if a user is saying yes, it is accepting it as 'its yes', 'yeah sure', 'okay yes'. We are observing the same behaviour in multiple uttera…
-
This is going to collect missing spaces a fter a period as discussed in https://github.com/petergtz/alexa-wikipedia/issues/37.
-
`# TextSplitter配置项,如果你不明白其中的含义,就不要修改。
text_splitter_dict = {
"ChineseRecursiveTextSplitter": {
"source": "huggingface", # 选择tiktoken则使用openai的方法
"tokenizer_name_or_path": "",…
-
**Describe the bug**
Using the speech recognition text function, this error will be reported starting from April 7th
This is the error log
[logspeech.txt](https://github.com/Azure-Samples/cogni…
-
Hello,
I'm trying to use the sherpa-onnx Python API to transcribe audio files with the zipformer model. However, I'm encountering an error indicating a dimension mismatch between the input data and…
-
[INFO] Listening to sound from Microphone: #24 - Microphone Array (Technologie Intel® Smart Sound)
[INFO] Listening to sound from Speaker: #22 - Enceintes (2- Realtek(R) Audio) [Loopback]
[INFO] Adj…
-
Please check whether this paper is about 'Voice Conversion' or not.
## article info.
- title: **V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time
Voice Anonymization**
- summary:…