Open GoogleCodeExporter opened 9 years ago
Wrote a groovy script to extract all roots (except roots with less than three
letters because they are most likely functional words, and roots with "&" (the
lateral fricative) because InukMagazine has different writing conventions for
Romanized Inuktitut) and to write a jape grammar. The source (list of roots) is
taken from Inuktitut Computing, converted from html to txt.
http://www.inuktitutcomputing.ca/DataBase/en/index.html
The jape grammar contains two types of rules: 1) extract words with roots; 2)
extract words with a common root.
Run in GATE, found ~30% of words per paragraph. Colour coded only
LexicographyKnown (Rule type 1), not by each root.
Please investigate why colour codes only LexicographyKnown.
Original comment by hisako...@gmail.com
on 9 Nov 2011 at 11:22
Original comment by a...@ilanguage.ca
on 10 Nov 2011 at 12:35
Original comment by a...@ilanguage.ca
on 10 Nov 2011 at 12:36
Original comment by a...@ilanguage.ca
on 25 Nov 2011 at 10:26
Original issue reported on code.google.com by
hisako...@gmail.com
on 9 Nov 2011 at 11:02