-
It might be possible to add in data from the Unihand database too.
Character search interface: https://unicode.org/charts/unihan.html
Unihan download here: https://www.unicode.org/Public/UCD/lates…
-
The module is very convenient, but it stays at Unicode 5.1.0, any plan to upgrade to latest Unicode standard? or any recommended module for latest Unihan database?
Thanks!
-
Hi,
I have found an instance of bad data in the database. I guess there could be more. Should the UniHan data be automatically cleaned before importing?
```
from cihai.core import Cihai
from c…
-
I feel it is useful and feasible to have a font for Japanese ruby with the same GSUB mechanism. As a lot of the tooling would be similar, this could preferably be incorporated in this codebase. I am u…
-
Currently we include the value of the unihan kFrequency field for a character in the popup output.
According to the unihan database, this is: "A rough frequency measurement for the character based …
-
Currently we display 4 possible pronunciation fields:
pinyin (from CEDICT)
mandarin (from Unihan -- which is almost always the same as pinyin but it seems not always)
cantonese (from Unihan)
tan…
-
```
目前碼表也基本夠用(除了地球拼音略小些),但 rimeime
是精益求精的輸入法,可以更好。
這個壓縮包裏: http://www.unicode.org/Public/UNIDATA/Unihan.zip 有個
Unihan_reading.txt
。把粵唐日韓越去掉後,可得到四萬多字的漢語拼音,帶聲��
�。
中研院漢字構形資料庫:
http://cdp.sinica.ed…
-
```
目前碼表也基本夠用(除了地球拼音略小些),但 rimeime
是精益求精的輸入法,可以更好。
這個壓縮包裏: http://www.unicode.org/Public/UNIDATA/Unihan.zip 有個
Unihan_reading.txt
。把粵唐日韓越去掉後,可得到四萬多字的漢語拼音,帶聲��
�。
中研院漢字構形資料庫:
http://cdp.sinica.ed…
-
```
目前碼表也基本夠用(除了地球拼音略小些),但 rimeime
是精益求精的輸入法,可以更好。
這個壓縮包裏: http://www.unicode.org/Public/UNIDATA/Unihan.zip 有個
Unihan_reading.txt
。把粵唐日韓越去掉後,可得到四萬多字的漢語拼音,帶聲��
�。
中研院漢字構形資料庫:
http://cdp.sinica.ed…
-
CEDICT can contain several entries for the same characters, and there is no indication as to which meanings / readings are more frequent. The Unihan database organises readings by frequency, so we can…