modelscope FunASR issues

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

https://www.funasr.com

Other

6.99k stars 744 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

IndexError: index 3 is out of bounds for dimension 1 with size 2

#2211 JonneryR opened 2 days ago
1
Intel MKL function load error: mkl_vml_kernel_sLn_ttab.

#2210 1749352011 opened 3 days ago
0
Add bounds check for postprocess_utils.py abbr_dispose()

#2209 BitSteve closed 3 days ago
0
请问Paraformer-V2的代码会开源吗？

#2208 NiniAndy opened 4 days ago
0
PCM录音文件读取似乎有问题

#2207 wjm030612 opened 5 days ago
1
paraformer_large_offline triton运行bug 修复

#2206 yijinsheng closed 5 days ago
1
paraformer_large_offline triton运行bug 修复

#2205 yijinsheng closed 6 days ago
0
cpu离线版本，客户端建立websocket请求，串行推送多个48k的语音wav，语音转写服务会重启，而且概率很高，log.txt也没见具体报错原因。生成的dmp（每次5g左右）文件会导致docker容器不断的变大。

#2204 janchou92 opened 1 week ago
2
FunASR 语音识别中的回声问题？

#2203 JACKYLUO1991 opened 1 week ago
0
vad FSMN语音端点检测-中文-通用-16k 内存泄漏问题

#2202 tonyzzzzz closed 1 week ago
0
GPU版找不到文件model_blade.torchscript

#2201 Jotree2012 opened 1 week ago
2
使用iic/speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch模型，运行offline_gpu服务报错

#2200 gzqqqqqq opened 1 week ago
2
使用damo/speech_paraformer-large-contextual_asr_nat-zh-cn-16k-common-vocab8404模型，运行offline_gpu服务报错

#2199 gzqqqqqq opened 1 week ago
0
funasr有没有类似于faster_whisper的whisperModel().transcribe的功能，可以获取segments?

#2198 liuchangzong opened 1 week ago
1
vad 模式切割的音频在转录后会自己删除吗

#2197 lesrose opened 1 week ago
0
docker服务端发送文本内容过长，超出缓冲区，导致消息发送失败

#2196 psk-github opened 1 week ago
0
FunASR开源项目体验demo中路径runtime/python/websocket/funasr_wss_server.py中没有timestamp时间戳，在线体验中的时间戳从何而来？

#2195 mrblacklee closed 1 week ago
1
标点重建模型在推理增加英文标点时将单词拆开

#2194 bigcash opened 1 week ago
0
怎么使用流式的fsmn-kws

#2193 hajkeoadf opened 1 week ago
0
在language identification中funasr 的问题

#2192 panxin801 opened 1 week ago
0
中文实时语音听写服务是否支持语音唤醒模型

#2191 hajkeoadf closed 1 week ago
1
第一次执行Quick Start中的命令时被意外中断，之后一直报modelscope - ERROR - 400 Client Error: Bad Request，似乎无法下载任何模型

#2190 HexJay closed 1 week ago
1
RuntimeError: "round_cuda" not implemented for 'Long'

#2189 lukeewin closed 1 week ago
1
安装了最新的1.1.6版本，还是报错choose a window size 400 that is [2, 160]

#2188 hjj-lmx closed 1 week ago
1
安装了最新的1.1.6版本，还是报错choose a window size 400 that is [2, 160]

#2187 hjj-lmx closed 1 week ago
0
Fix audio format 2.0

#2186 Djraemon closed 2 weeks ago
0
Paraformer-en模型在推理时报错，spm index超出范围

#2185 MrSupW closed 1 week ago
1
能说一下docker里面/etc/docker/daemon的原理和使用方法吗

#2184 tiandiweizun opened 2 weeks ago
0
springboot集成modelscope模型

#2183 zhangzhaogit opened 2 weeks ago
1
在Window10家庭版安装funasr报错Failed to build aliyun-python-sdk-core，使用pip install源码安装都不行

#2182 jhchhz closed 1 week ago
2
使用微调之后的模型音频识别为空

#2181 fjt12138 opened 2 weeks ago
1
C++ Websocket多线程卡住

#2180 monsterlyg closed 2 weeks ago
2
为什么我的识别结果格式和平台在线的不一样，求好心人告知

#2179 ordinary-lv closed 1 week ago
1
seaco语音模型微调报错：'NoneType' object has no attribute 'contiguous'

#2178 smengfei closed 1 week ago
5
perf(models/FsmnVADStreaming): optimize GetFrameState() and PopDataToOutputBuf()

#2177 truc0 closed 2 weeks ago
0
seaco语音模型微调报错：'NoneType' object has no attribute 'contiguous'

#2176 smengfei closed 3 weeks ago
0
Enhanced Evaluation Script with Argument Validation, Structured Error Handling, and Detailed Token-Level Reporting in run_evaluate.py

#2175 vignesh1507 closed 2 weeks ago
0
optimize ComputeDecibel in fsmn-vad model by using numpy

#2174 hongfanmeng closed 3 weeks ago
0
部署FunASR离线文件转写服务GPU版本时出错

#2173 17806233962 closed 3 weeks ago
0
长语音识别的问题

#2172 jlljill opened 3 weeks ago
0
求助我们用funasr接受麦克风音频，用mediapipe判断嘴唇与说话频率问题在于如果嘴张开动没有说话，旁边的人说话，系统会认为是检测(mediapipe识别到的人脸)到的人在说话

#2171 Redhair957 closed 1 week ago
1
There is no key "file_path_metas" in Qwen-Audio/configuration.json

#2170 JiajunDou opened 3 weeks ago
0
fsmn-vad 模型似乎有 CPU 性能问题

#2168 hongfanmeng closed 3 weeks ago
5
Add Contribution.md to the main branch.

#2167 vignesh1507 closed 3 weeks ago
0
能否微调成一个分类任务，target为每条语音的分类标签

#2166 chaunceyliu30 closed 1 week ago
1
按照seacoparaformer中finetune.sh微调时报错，是不是需要修改["source", "target"]呢？改成热词版的数据格式呢？

#2165 YouTwoMeToo closed 3 weeks ago
1
docker镜像funasr-runtime-sdk-online-cpu-0.1.11，每次链接建立都造成内存增加，断开后内存不下降

#2164 EAGLE50 closed 2 weeks ago
3
请问微调后的模型，如何计算字错率？

#2163 lukeewin closed 3 weeks ago
1
Paraformer语音识别-中文-通用-16k-离线-large-长音频版，微调量化导出后的模型文件替换docker中的模型文件后的效果与本地验证不一样

#2162 chiyinbao opened 3 weeks ago
1
C++服务端WebSocket添加鉴权功能

#2161 cuiyuanzhe opened 3 weeks ago
5