An error occurred when use pretrained model to train WenetSpeech dataset

yeyupiaoling / PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Apache License 2.0

804 stars 128 forks source link

An error occurred when use pretrained model to train WenetSpeech dataset #30

Closed tcexeexe closed 2 years ago

tcexeexe commented 2 years ago

Hello~ l copy the pretrained model form "PPASR_大数据集/models/deepspeech2/best_model" to the "/PPASR/models/deepspeech2/last_model", then l start trainning, but an error occurred. l wonder what's the reason of this problem?

AssertionError: Variable Shape not match, Variable [ linear_0.w_0_moment1_0 ] need tensor with shape (1024, 5451) but load set tensor with shape (1024, 6436)

yeyupiaoling commented 2 years ago

Because dictionaries are different sizes. It is recommended to use all the files in the pre-training dataset.

tcexeexe commented 2 years ago

Because dictionaries are different sizes. It is recommended to use all the files in the pre-training dataset.

好的，谢谢!

tcexeexe commented 2 years ago

Because dictionaries are different sizes. It is recommended to use all the files in the pre-training dataset. 你好，那如果按照你的意思，我要在原有模型上，添加新的词语，做迁移学习的话，是不是就不行了

yeyupiaoling commented 2 years ago

那个大数据的字典基本已经包括常用的汉字了

tcexeexe commented 2 years ago

那个大数据的字典基本已经包括常用的汉字了

好的