donkuri / Kaishi

Kaishi 1.5k is a modern, modular Japanese Anki deck made for beginners who want to learn basic vocabulary.
409 stars 7 forks source link

Properly distribute furigana over keywords and example sentences #21

Closed stephenmk closed 6 months ago

stephenmk commented 6 months ago

This deck does not correctly distribute furigana over keywords and sentences. Beginners may not be able to tell that 恐怖 is きょう + ふ rather than きょ + うふ, for example.

I used a furigana solver to fix the distributions.

The data is posted here: https://gist.github.com/stephenmk/3f7fb2d36b5c990298f2319b4f06c8fb

The solver is somewhat opinionated. For example, it doesn't consider 日本 to be separable into に + ほん; it considers にほん to be an indivisible idiomatic reading. If you disagree, you could replace all instances of 日本[にほん] with 日[に] 本[ほん].

BeforeAfter
![kibou1](https://github.com/donkuri/Kaishi/assets/8003332/1aecc3fa-dc47-4edd-b82e-a7022fb5a39b) ![kibou2](https://github.com/donkuri/Kaishi/assets/8003332/906a9694-76b9-4680-a1a1-b48466136009)
![kyoufu1](https://github.com/donkuri/Kaishi/assets/8003332/3174fc9f-e8cb-4154-ab8f-76257ba8adfb) ![kyoufu2](https://github.com/donkuri/Kaishi/assets/8003332/a1a8d773-3120-428b-b9ea-40b96836c053)
![ryoukai1](https://github.com/donkuri/Kaishi/assets/8003332/e6aafa0d-c199-43e6-a05c-0462293bf366) ![ryoukai2](https://github.com/donkuri/Kaishi/assets/8003332/5d34cdff-608e-47fd-bbf4-5f5817830cc5)

I only found a few errors in the deck data. There was the こわばった error that you can see in the example sentence for 恐怖 displayed above. The "Word Furigana" values for 戦い and 入り口 also had errors.

donkuri commented 6 months ago

This is absolutely fantastic! Thank you very much, I have updated the cards accordingly. This is now Kaishi 1.5k v1.3.0.