拼音到汉字的解码

athena-team / athena-decoder

Apache License 2.0

75 stars 26 forks source link

拼音到汉字的解码 #14

Open liubin3702 opened 4 years ago

liubin3702 commented 4 years ago

作者您好：如果我想在您代码基础上，使用WFST或者Beam-Search完成` 拼音到汉字的解码，我需要做些什么？例如：首先我要获得每个输入拼音状态下对应字典中汉字的一个概率分布（示例中的score.txt中每个状态的batch=3,6,8，11是什么意思）然后我还需要为代码提供哪些东西来完成拼音到汉字的解码呢？望回复

godjealous commented 4 years ago

Firstly, you should have an acoustic model. Actually, the Athena-Team project have already provide completed examples to build an ASR system: from building acoustic model to decode using or not using WFST tecology. Please reference to Athena project in our Athena-Team project or just browse here : https://github.com/athena-team/athena#5-decoding-with-wfst