nicolas-raoul / kakasi-java

Kanji transliteration to hiragana/katakana/romaji, in Java
GNU General Public License v2.0
54 stars 19 forks source link

Japanese Kakasi Dictionary Has Some Weird Letters #1

Open LeesTimeDotCom opened 11 years ago

LeesTimeDotCom commented 11 years ago

Can you check your dictionary: https://raw.github.com/nicolas-raoul/kakasi-java/master/dict/kakasidict

and this post: http://stackoverflow.com/questions/12786355/japanese-kakasi-dictionary-has-some-weird-letters

LeesTimeDotCom commented 11 years ago

From the post Nicolas just replied. It's kind of Japanese okurigana.

But I still have some questions.

In this source code file: https://github.com/nicolas-raoul/kakasi-java/blob/master/src/com/kawao/kakasi/KanjiYomi.java

        okuriganaTable.put(new Character('\u3041'), "aiueow");
        okuriganaTable.put(new Character('\u3042'), "aiueow");
        okuriganaTable.put(new Character('\u3043'), "aiueow");
        okuriganaTable.put(new Character('\u3044'), "aiueow");
        okuriganaTable.put(new Character('\u3045'), "aiueow");
        okuriganaTable.put(new Character('\u3046'), "aiueow");
        okuriganaTable.put(new Character('\u3047'), "aiueow");
        okuriganaTable.put(new Character('\u3048'), "aiueow");
        okuriganaTable.put(new Character('\u3049'), "aiueow");
        okuriganaTable.put(new Character('\u304a'), "aiueow");

Why for each basic vowel hiragana: it has 6 ("aiueow") okurigana letters?

and also for:

        okuriganaTable.put(new Character('\u308e'), "wiueo");
        okuriganaTable.put(new Character('\u308f'), "wiueo");
        okuriganaTable.put(new Character('\u3090'), "wiueo");
        okuriganaTable.put(new Character('\u3091'), "wiueo");

and

okuriganaTable.put(new Character('\u3063'), "aiueokstchgzjfdbpw");