Closed ceramisch closed 5 years ago
This looks like a backward-incompatibility problem in the new library. There is a new more elaborate mechanism for handling annotators and provenance in general. I'll look into it, because the library should retain a good backward compatibility.
I just released FoLiApy v2.1.3 which should fix this problem (please reopen if something still goes wrong).
You may already be aware of this, but just in case: make sure you the latest version of FLAT if you plan on annotating FoLiA v2 documents, older versions won't be able to handle the newer documents.
In your script, you might also want to make use of the new provenance framework, by registering your script as a processor on Document instantiation, like this for example:
doc = folia.Document(id=....., processor=folia.Processor.create("parseme-utilities", version="0.1"))
Any added annotations will then automatically be associated with the processor. It's not mandatory but might be nice to have (other FoLiA tools including FLAT will also add themselves to the provenance chain).
It works now, thanks @proycon
In PARSEME, we have a script developed by @silvioricardoc that converts cupt format (PARSEME's CoNLLU-plus version) to folia format. However, the script is not correctly converting annotator and annotatortype in comments. The problem is in the following method:
where
self._folia_kwargs()
is a dict containing key=value pairs, including annotator and annotatortype:The values seem to be correctly appended to the
folia.Comment
object but when we print the corresponding XML, the information is missing.The problem can be reproduced by dowloading the PARSEME tools:
Information about
annotator
andannotatortype
disappeared in the first conversion from conllup to folia. This didn't happen before...Could it be a bug in the folia library?