EmilStenstrom / conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
MIT License
311 stars 50 forks source link

Consider supporting CoNLL-U Plus #33

Closed EmilStenstrom closed 5 years ago

EmilStenstrom commented 5 years ago

More info available here: https://universaldependencies.org/ext-format.html

EmilStenstrom commented 5 years ago

The CoNLL-U Plus format has column definitions as metadata:

# global.columns = ID FORM LEMMA UPOS XPOS FEATS HEAD DEPREL DEPS MISC

Could we use these fields to parse non-standard sentences without having to specify any fields when parsing?

EmilStenstrom commented 5 years ago

Fixed in 1f460a8