yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
Apache License 2.0
866 stars 143 forks source link

如果数据集中有的词不在底模的词表中,会有什么问题? #89

Closed ILG2021 closed 1 month ago

ILG2021 commented 2 months ago

1 如果不微调词嵌入,会对结果有影响吗? 2 如果微调词嵌入,大概要训练多久?

yeyupiaoling commented 1 month ago

@ILG2021 没试过微调词表,不建议修改词表,有可能导致模型效果下降