sorlok / waitzar

Automatically exported from code.google.com/p/waitzar
Other
0 stars 0 forks source link

Add support for Shan. #84

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
#This will first require some overhaul to the key-based input in WaitZar.
(At the moment, it is only enabled for the help keyboard).

Here are some Shan resources, to get started:
http://sealang.net/shan/dictionary.htm
http://myanmaritpros.com/forum/topic/show?id=1445004%3ATopic%3A115333
http://saiddzone.googlepages.com/yunghkio

We will need a Shan corpus to mine trends from, just like we did with
standard Burmese. Setting this for long-term.

Original issue reported on code.google.com by seth.h...@gmail.com on 2 Jul 2009 at 4:23

GoogleCodeExporter commented 9 years ago
Hi Seth

I'm not sure that attachment is useful for building word list.
you may see some syllable pattern in the last paragraph.

I had shan dictionary in ascii format too. I will ask original author and reply 
you
back her decision.

Ngwe Tun

Original comment by ngwes...@gmail.com on 3 Jul 2009 at 7:47

Attachments:

GoogleCodeExporter commented 9 years ago
Thank you, that would be very helpful. The process is quite slow, but we will 
get
there eventually. (The original version of WaitZar took over 1.5 years to 
research).
We want to make sure we do it right the first time. ;)
-->Seth

Original comment by seth.h...@gmail.com on 3 Jul 2009 at 9:44

GoogleCodeExporter commented 9 years ago
The following web site has lots of Shan text (from Minn Kyaw)
http://www.mongloi.org/

Original comment by seth.h...@gmail.com on 27 Nov 2009 at 12:26

GoogleCodeExporter commented 9 years ago
Simple demo; seems to work in principle. (Need testers, now...)

Original comment by seth.h...@gmail.com on 21 Jun 2010 at 10:37

Attachments:

GoogleCodeExporter commented 9 years ago
Ko Thar Thar has suggested someone to talk to. Hope to get a new tester...

Original comment by seth.h...@gmail.com on 20 Sep 2010 at 10:47

GoogleCodeExporter commented 9 years ago
New tester added to the developers team.

Original comment by seth.h...@gmail.com on 27 Sep 2010 at 3:34

GoogleCodeExporter commented 9 years ago
New issues:
  1031 and 1084 should be typed before consonants, and re-ordered AFTER them.

Currently, we are required to type them after.

Example:
ေတႃႇမီႇႄလႈႁဝ်းေတမႃးလဵၼ်ႈ
...is typed as:
တေႃႇမီႇလႄႈႁဝ်းတေမႃးလဵၼ်ႈ

Original comment by seth.h...@gmail.com on 27 Sep 2010 at 4:52

GoogleCodeExporter commented 9 years ago
Fixed. Will update a new demo to the Shan testers soon.
Note: I put U103C (ya-yit) as typable "before", like 1031 and 1084. This seemed 
correct, but I'll have to check with the testers.

Original comment by seth.h...@gmail.com on 27 Sep 2010 at 8:28

GoogleCodeExporter commented 9 years ago
Ko Sai Di has said that WZ works correctly now. Closing this bug; anything new 
will be a bug report (not a feature request).

Original comment by seth.h...@gmail.com on 30 Sep 2010 at 8:28