vanatteveldt / alpino-server

Simple web interface for alpino
MIT License
2 stars 1 forks source link

Documentation | Help understanding NAF XML file #1

Closed jochemstoel closed 5 years ago

jochemstoel commented 5 years ago

Hi, I am really glad I found this Alpino server! Where can I find some docs about the NAF XML file format? I recognize some abbreviations instinctively but it is unclear to me how to parse the persoonsvorm, onderwerp etc from the file and determine exactly the relationships between words . I will use JavaScript to parse the output from the server "De banaan is zeer optimistisch" and want to be able to determine that (zeer) optimistisch is a property of banaan and zeer is a bijwoord describing optimistisch.

Or am I expecting too much?

vanatteveldt commented 5 years ago

Thanks for your interest. That is absolutely something you can get out of the parse tree.

The NAF documentation is available e.g. here: https://research.vu.nl/en/publications/naf-the-nlp-annotation-framework.

You probably want to get mostly the part of speech (from the term layer) and the dependencies.

(cc @antske because she knows more about the NAF side)

jochemstoel commented 5 years ago

Thanks. I might be completely blind but where on that page is the content? I was expecting something like a reference or PDF file to read.

vanatteveldt commented 5 years ago

Sorry, my bad, I thought it would link to the report directly.

Actually, maybe their github is simply the best source: https://github.com/newsreader/NAF