TraCES-Lexicon / lexicon

Geez lexicon of the TraCES project
1 stars 0 forks source link

arethusa integration for treebanking annotation #11

Open PietroLiuzzo opened 7 years ago

PietroLiuzzo commented 7 years ago

(elaborated from replay of Bridget Almas to request to use the arethusa tool for ethiopic) http://www.perseids.org/tools/arethusa/app/#/

lexicon service should work as a morphological service that can parse Ethiopic text, to be hooked up in arethousa. You could also use the Universal Dependencies tagsets (which you can experiment with by choosing Advanced Options on new Treebank input form, and selecting UD English)

If you want to make your own tag sets and/or morphological service, there are some resources:

For Morphology: https://github.com/alpheios-project/arethusa/wiki/Creating-a-Morphological-Attribute-Configuration-File https://github.com/alpheios-project/arethusa/wiki/Adding-a-new-Morphology-Service-to-Arethusa

For Relations, Thibault just produced a very handy little tool that creates them from CSV files: http://arethusa-grammar.herokuapp.com/

These can then be uploaded to the Perseids hosted site by forking the https://github.com/alpheios-project/arethusa-configs/ repository, adding them there, and then sending a pull request. You can browse the variety of different tag sets in that repo for examples.

PietroLiuzzo commented 7 years ago

Goal

integrate the lexicon with the arethusa tool to produce treebanks for ethiopic texts

Main actor(s)

users of arethusa and researchers willing to do treebanking of ethiopic texts

Short description

(elaborated from replay of Bridget Almas to request to use the arethusa tool for ethiopic) http://www.perseids.org/tools/arethusa/app/#/

Examples

You can browse the variety of different tag sets in the above mentioned repo for examples.

Preconditions

the lexicon should be able to do #2 and have defined labels for morphology