However, it seems some Japanese words are incorrectly recognized.
If we say "こんにちはKonnichiwa", it is recognized as 今日は by Vosk SDK ( Which means "today is…(Kyou ha)" .)
"今日は" doesn't mean "こんにちは"
("今日は" can be pronounced "Konnichiwa", but usually pronounced "Kyouwa".
It's expected that the words output is "こんにちは", not 今日は.
The problem seems to be that inappropriate hiragana-to-kanji conversion is taking place. However, the Vosk model does not output hiragana and then convert it to kanji, but outputs kanji from the beginning, hence making this issue happens.
@nshmyrev We tested Japanese small model, which is downloadable here: https://alphacephei.com/vosk/models/vosk-model-small-ja-0.22.zip, in an Android app.
However, it seems some Japanese words are incorrectly recognized.
If we say "こんにちはKonnichiwa", it is recognized as 今日は by Vosk SDK ( Which means "today is…(Kyou ha)" .)
"今日は" doesn't mean "こんにちは" ("今日は" can be pronounced "Konnichiwa", but usually pronounced "Kyouwa".
It's expected that the words output is "こんにちは", not 今日は.
The problem seems to be that inappropriate hiragana-to-kanji conversion is taking place. However, the Vosk model does not output hiragana and then convert it to kanji, but outputs kanji from the beginning, hence making this issue happens.
There's a previous issue that we think being related here: https://github.com/alphacep/vosk-api/issues/1047
We'd like to attach a video demonstrating the issue here:
https://github.com/alphacep/vosk-api/assets/168809162/3718f5c1-d399-4472-9d45-1e90adcc4049
Could you take a look into this issue when you have some free time? We really appreciate your help.