openvanilla / McBopomofo

小麥注音輸入法
http://mcbopomofo.openvanilla.org/
MIT License
615 stars 76 forks source link

Add missing character 萜 #496

Closed xatier closed 1 month ago

xatier commented 2 months ago

Ref:

https://en.wiktionary.org/wiki/%E8%90%9C https://www.wikiwand.com/zh-tw/%E8%90%9C%E7%83%AF https://www.wikiwand.com/zh-tw/%E7%B1%BB%E8%90%9C

xatier commented 2 months ago

Terpenes and terpenoids 💨

ChiahongHong commented 2 months ago

Great! Although it won't affect the actual outcome, should we change the encoding of this character from big5 to utf8 to maintain encoding correctness before removing the encoding field as discussed in https://github.com/openvanilla/McBopomofo/pull/491#issuecomment-2161376260?

xatier commented 2 months ago

Sure thing, fixed! Thanks for the advice!

tianjianjiang commented 2 months ago

雖然可能不常這樣寫,但我有點好奇「萜烯烴」會不會正確地轉換出來,還是會變成「萜烯聽」或其他的組合。

ChiahongHong commented 1 month ago

@tianjianjiang

I think this special case is inherently difficult to predict the outcome, so it should be fine.

If necessary, we can also add 萜烯烴 to the list.

https://github.com/openvanilla/McBopomofo/assets/36815907/1cf8c21e-df83-4c1c-b804-be58464b623b

tianjianjiang commented 1 month ago

@tianjianjiang

I think this special case is inherently difficult to predict the outcome, so it should be fine.

If necessary, we can also add 萜烯烴 to the list.

@ChiahongHong Thank you for the usage video, that's really helpful.

I also anticipate that difficult situation. As long as it is still acceptible to you (and hopefully the online learning algorithm will kick in soon enough), I have no strong preference for adding 萜烯烴 or not.