Open AngledLuffa opened 1 month ago
Can I bump this? It's in the way of using that script to measure the quality of new models built from the latest UD release.
Hmm. This is a copy of the evaluation script that was used in the UD parsing shared tasks. I have not investigated the details but I suspect it was never applied to data containing empty nodes. Even in the Enhanced UD parsing shared tasks, paths with empty nodes were first collapsed and empty nodes removed, then this script was applied.
This is not to say the script should not be updated to digest any valid UD data. It definitely should.
I've found it works fine when used on datasets with empty nodes. This one is unique in that the empty node occurs in the middle of an MWT.
Do you need me to update it? I'd honestly prefer it if someone else took it on, but either way
I got the following error when using the version of
eval.py
that ships with UD 2.14 when trying to readar_padt.dev.gold.conllu
by callingload_conllu_file
directly:(The path is weird, but that's because we copy it into our codebase and then import it. Incidentally, releasing it as a Python package would be very helpful.)
The issue is that there's a sentence with an empty node in the middle of an MWT: