sschmidTU / mr-kanji-search-wtk

WTK-Search is a Kanji search engine using (multiple) Wanikani radicals or RTK names, on a RTK element dataset of 3000+ Kanji
https://sschmidtu.github.io/mr-kanji-search-wtk/
7 stars 2 forks source link

Annotate remaining RTK3 Kanji #1

Closed sschmidTU closed 3 years ago

sschmidTU commented 3 years ago

Of the ~3030 RTK Kanji included in the original data, only ~2220 were originally annotated with RTK elements (counterpart to WK 'radicals'). Currently, ~2600 Kanji are annotated (all until 2604.md).

The goal is to annotate all 3030 RTK Kanji contained in the data, then add new kanji and annotate them as well.

The non-annotated kanji are missing the elements entry in the .md files. This regex over the project (.md files) can currently pretty reliable find non-annotated kanji: keyword:.*\nstrokes This stops working when elementsWK are added however, then we need another regex.

I'm continuously annotating them (starting at 2043.md), will just take some time.

sschmidTU commented 3 years ago

I now have the rtk3 book, which will make annotation of the rtk3 kanji which don't yet have elements much easier.

Yet, rtk3 doesn't split the keywords up into its primitive components, so these should be added. (for 倭 Yamato rtk3 only gives person committee, but we should add person, committee, wheat, woman to elements)

sschmidTU commented 3 years ago

Finished checking rtk3 v4 Kanji 2043-2106, from 2107 now on most kanji will not yet be annotated (have elements) at all.

sschmidTU commented 3 years ago

Annotated up until RTK3 #2500.

sschmidTU commented 3 years ago

Annotated up until RTK3 #2604.

sschmidTU commented 3 years ago

Done, all RTK kanji (that i know of) are indexed and searchable by elements now. (starting in version 1.1.2.0)