keggsmurph21 / notatrix

[DEPRECATED] An experimental notation system for UD Annotatrix that combines CoNLL-U and CG3
GNU General Public License v3.0
2 stars 0 forks source link

Automatic testing with .conllu files #19

Open ftyers opened 4 years ago

ftyers commented 4 years ago

Around a third of the sentences that I put into annotatrix don't display for some reason. It would be good to set up some kind of automatic testing with notatrix on all of the UD treebanks that pass validation. For example, https://quest.ms.mff.cuni.cz/udvalidator/cgi-bin/unidep/validation-report.pl?UD_Chukchi-HSE You can see that Chukchi passes validation, but some of the sentences don't parse with notatrix.

kmurphy4 commented 4 years ago

I totally agree. I'll look into setting this up. There is a test suite for notatrix, but it doesn't have nearly enough data.

Do you know of similar "treebanks" for the other formats (cg3, sdparse, brackets, etc.)?

ftyers commented 4 years ago

Hmm, I don't, but those are much less error prone (and used) it seems than conllu.

keggsmurph21 commented 4 years ago

Ok. I'll look into integrating more conllu tests tho!