有没有比较可用的语音识别模型 - Githubissues

yeyupiaoling / MASR

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

Apache License 2.0

572 stars 100 forks source link

有没有比较可用的语音识别模型 #9

Closed shenyingying closed 3 years ago

shenyingying commented 3 years ago

如题

yeyupiaoling commented 3 years ago

@shenyingying 文档最后

shenyingying commented 3 years ago

@shenyingying 文档最后

不好意思，我可能没有表述明白自己的意思，我是说的除了这个网络，还有没有其他别的语音识别这块的开源工程，因为我只搜到masr和百度的deepspeech？

yeyupiaoling commented 3 years ago

@shenyingying 我目前也只是用着两个模型，其中MASR只能用于学习，如果要应用，最好用DeepSpeech：https://github.com/yeyupiaoling/PaddlePaddle-DeepSpeech

shenyingying commented 3 years ago

@shenyingying 我目前也只是用着两个模型，其中MASR只能用于学习，如果要应用，最好用DeepSpeech：https://github.com/yeyupiaoling/PaddlePaddle-DeepSpeech

但我尝试deepSpeech 好像效果不怎么好，没有masr 效果好

yeyupiaoling commented 3 years ago

@shenyingying 你用我的模型咯：https://github.com/yeyupiaoling/PaddlePaddle-DeepSpeech 同时我也提供了预训练模型，你试一下，但我只训练了十几轮，你先用着试试

shenyingying commented 3 years ago

@shenyingying 你用我的模型咯：https://github.com/yeyupiaoling/PaddlePaddle-DeepSpeech 同时我也提供了预训练模型，你试一下，但我只训练了十几轮，你先用着试试

MRSR 中单独用自己数据出了三天nan也没关系吗？

yeyupiaoling commented 3 years ago

@shenyingying 出现nan不行的，我说的是出inf还可以回来。看这个回答：https://github.com/yeyupiaoling/MASR/issues/6#issuecomment-706454651

yeyupiaoling commented 3 years ago

@shenyingying 还有疑问吗？没有我就关闭issue了