athena-team / athena-decoder

Apache License 2.0
75 stars 26 forks source link

why the dim is 3651 #6

Open HalFTeen opened 4 years ago

HalFTeen commented 4 years ago

Hi, guys. I am wonder why the dim is 3651 in the file "athena-decoder/examples/hkust/wfst_scores.txt" while the length of vocab is 3650. Looking forward to your reply.

godjealous commented 4 years ago

It is default setting in Athena-team that \<sos> and \<eos> are the same and are always the last label in table.

So the scores of the last dim (3651) is for \<eos>