-
from funasr import AutoModel
model_path = r'E:\GC\speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch'
vad_path = r'E:\GC\speech_fsmn_vad_zh-cn-16k-common-pytorch'
punc_path = r'…
-
ubuntu
git clone https://github.com/alibaba-damo-academy/FunASR
```python
from funasr import AutoModel
# paraformer-zh is a multi-functional asr model
# use vad, punc, spk or not as you need
m…
-
我看3d-speaker已经支持C++了
![image](https://github.com/alibaba-damo-academy/FunASR/assets/12029452/b6d335c3-7c6e-42a4-a4cb-85ce9792176b)
请要funasr要怎么才能使用说话人日志模型,没找到有对应参数传入这个模型。
或者结合这个模型使用[iic/speech_campp…
-
Notice: In order to resolve issues more efficiently, please raise issue following the template.
(注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
## 🐛 Bug
使用1.x版本funasr,跑aishell训练例子时,在stage 1 compute_audio_cmv…
-
## ❓ VAD模型添加了max_end_silence_time参数无效果
### Before asking:
1. issues无相关问题
2. doc里有提这个参数,但是没有使用示例
#### 我使用示例中的代码对一个音频做VAD分割,很多结果达到了最新限制60秒,添加max_end_silence_time参数对结果没有影响,从500改到1500,输出结果一样…
-
![image](https://github.com/alibaba-damo-academy/FunASR/assets/12029452/8c5a7c0f-707c-45f2-a0f4-4844ec4a2976)
我是通过dokcer镜像版本funasr-runtime-sdk-cpu-0.4.4 (2dc87b86dc49)进行部署的,但是发现还不支持说话人日志功能。
希望runt…
-
**Describe the feature**
Features description
**Motivation**
A clear and concise description of the motivation of the feature. Ex1. It is inconvenient when [....]. Ex2. There is a recent paper [.…
-
**General Question**
需求: 调用说话人识别模型进行大规模数据推理;
使用模型:
task='speaker-diarization',
model='damo/speech_campplus_speaker-diarization_common',
m…
-
服务部署目前不支持 说话人识别模型
_Originally posted by @lyblsgo in https://github.com/modelscope/FunASR/issues/1780#issuecomment-2146419327_
-
一段电视剧声音,含背景音,几乎区分不出4个说话人,全部是speaker 0