huawei-noah / Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
3.02k stars 628 forks source link

词表 #68

Open oldstree opened 4 years ago

oldstree commented 4 years ago

您好,下载了预训练好的小模型,但是发现有些常见的中文汉字没有再词表中,想问下原因,十分感谢您的分享!

chauncy-cc commented 4 years ago

Hi,github中分享的TinyBert是处理英文数据的模型,无法处理中文。

oldstree commented 4 years ago

好的,感谢!!

chauncy-cc commented 4 years ago

不客气~

oldstree commented 4 years ago

您好,方便分享处理中文数据的模型吗?开源的代码适合训练中文的数据吗?