-
I have trained the PASE+ encoder on my custom datatset but I couldnt see anything about data preparation for ASR experiment on custom dataset. I have a set of .wav files and their corresponding text t…
-
### env:
#### OS:
- Windows 10
#### Python:
- 3.8.10
#### Requirements:
- paddlepaddle==2.4.2
- paddlenlp==2.5.2
- paddlehub==2.3.1
- paddlespeech==1.4.1
### File Structure:
![image…
-
are there any plans to add vosk as an asr engine?
https://github.com/alphacep/vosk-api
-
Evening,
I am having issues with my microphone (nothing to do with rhasspy at all) and that resulted in the following timeout when I use the wake button:
![image](https://user-images.githubuserc…
-
`Traceback (most recent call last):
File "/home/users/sagar/nvidia_deepspeech2/lib/python3.7/site-packages/tensorflow/python/client/session.py", line 1334, in _do_call
return fn(*args)
File…
-
The following sequence of warnings is repeated many times when i build the documentation
```
If you want to connect to this kernel from an other client, you can use the /private/var/folders/cp/df3…
-
```fortran
program main
integer, parameter :: x1(2, 2) = reshape([1, 2, 3, 4], [2, 2])
print *, x1
end program
```
```console
(base) saurabh-kumar@Awadh:~/Projects/System/lfortran$ gfortr…
-
1、会议记录最重要的功能,识别说话人,希望A说话时显示A:“12345”B:“巴拉巴拉”回到A时能识别出A说:“巴拉巴拉”类似这种功能,能通过添加模型实现吗?
2、添加一键总结功能,当我按下停止键后,跳出一个总结按键,可调用AI大模型对识别内容进行总结,比如接个gpt或者fastgpt之类的,因为有离线部署本地知识库,希望能有类似功能。非常感谢。
3、求添加录音转文字的简单ASR,拖拽到对话框…
-
# Authenticate to Azure
Connect-AzAccount
# Define your subscription ID, resource group name, and ASR vault name
$subscriptionId = "your-subscription-id"
$resourceGroupName = "your-resource-group-nam…
-
### 🐛 Describe the bug
On H100s and A100s instances setting PYTORCH_CUDA_ALLOC_CONF='expandable_segments:True' is not taking affect with the latest nightllies.
```
[W CUDAAllocatorConfig.h:28]…