chewing / libchewing

libchewing - The intelligent phonetic input method library
https://chewing.im/
GNU Lesser General Public License v2.1
366 stars 91 forks source link

chewing-cli 應該也要有注音修正功能 #655

Open kanru opened 1 week ago

kanru commented 1 week ago

Discussed in https://github.com/chewing/libchewing/discussions/654

Originally posted by **llc0930** November 5, 2024 chewing-cli 應該也要有注音修正功能。 一修正為ㄧ;丫修正為ㄚ。 尤其是前者...教育部的xls跟xlsx真是個噩夢 ![2024-11-05-10:53:20-r](https://github.com/user-attachments/assets/5aca7c96-cca0-4d79-8c8f-d1458a09deef) chewing-cli 生成詞庫時顯然會去掉重複字詞,所以沒有做去重... [《成語典》dict_idioms_2020_20240926.txt](https://github.com/user-attachments/files/17626371/dict_idioms_2020_20240926.txt) `chewing-cli init-database -n "《成語典》" -c "中華民國教育部" -l "CC BY-ND 3.0 臺灣" -r "2020_20240926" ./《成語典》dict_idioms_2020_20240926.txt dict_idioms_20240926.dat` ``` == Trie Dictionary Statistics == Node count : 19226 Leaf count : 5186 Phrase count : 5456 Max height : 9 Average height : 1 Root branch count : 705 Max branch count : 80 Average branch count : 0 ```
kanru commented 1 week ago

其他的簡單修正 https://github.com/chewing/libchewing/discussions/656#discussion-7450782