openboard-team / openboard

GNU General Public License v3.0
2.57k stars 253 forks source link

toki pona language #822

Open nehemiagurl opened 1 year ago

nehemiagurl commented 1 year ago

(sorry for not doing this via a PR, I'm just not that good at git) toki pona is a 21st century minimalist constructed language. it can be written with the Latin character set, and has a very small vocabulary - the dictionary below has 257, with many being neologisms (nimi sin) that are not widely used (don't worry, the frequencies are adjusted down accordingly). the vocabulary is taken from ilo Linku, a database of all toki pona words in use that is updated yearly based on a community-wide survey. ilo Linku data is licensed under CC by-sa 3.0 and 4.0 combined license (proof 1, proof 2) so no problem to implement it.

in the zipped folder are the .dict file and the .combined.gz file as created by me.

toki pona.zip

nehemiagurl commented 1 year ago

It has come to my knowledge that the fact the file has version=1 may cause issues, so I replaced it with version=18 (all other data kept the same) toki pona.zip