enabling-languages / myanmarweb

Tools and resources for Myanmar Web development
MIT License
7 stars 3 forks source link

REfactor and rewrite all regexes for syllbale break identification #7

Closed andjc closed 9 years ago

andjc commented 9 years ago

Refactor and rewrite all regexes for syllable break identification

Myanmar (Burmese) and Sgaw Karen currently have bugs in them. But would be fest to refactor them with improved regular expressions.

To approaches: 1) A regular expression designed for full syllable identification

andjc commented 9 years ago

Replaced issue #6

andjc commented 9 years ago

([^\s`~!@#\$%^&*()_-=+[{]}|;:'",<.>/\?][\u1000-\u1021\u1023-\u1027\u1029\u102A])