wenet-e2e / WenetSpeech

A 10000+ hours dataset for Chinese speech recognition
Apache License 2.0
506 stars 49 forks source link

从modelscope下载数据集脚本需要更新 #57

Open littlebowlnju opened 2 months ago

littlebowlnju commented 2 months ago

由于ModelScope版本经过了多次更新,按照当前项目README中所指示的命令运行会遇到诸多错误。 部分问题可能是由于ModelScope版本更新后的代码改动造成,可通过降低modelscope版本解决,然而README中并没有指定modelscope下载的版本,默认下载了最新版。部分问题则是脚本本身就应该针对modelscope下载做出的修改。

遇到的问题及解决方式如下:https://blog.csdn.net/granday/article/details/141605664?csdn_share_tail=%7B%22type%22%3A%22blog%22%2C%22rType%22%3A%22article%22%2C%22rId%22%3A%22141605664%22%2C%22source%22%3A%22granday%22%7D

xjtupanda commented 2 months ago

This configuration worked fine for me:

pip install datasets==2.18.0 modelscope==1.14.0

I used Python of version 3.9.19