proycon / foliatools

A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
GNU General Public License v3.0
10 stars 4 forks source link

Conversion failure due to unclear cause #49

Open martinreynaert opened 2 years ago

martinreynaert commented 2 years ago

Hi,

I get this:

(LMdev) reynaert@violet:MARXENGELS$ tei2folia -i -o '-' MarxEngels-A-2003_01-The_Collected_Works_of_Karl_Marx_and_Frederick_Engels_Early_Works_1835_1844_Volume_1-V0.xml Instantiating XML parser Converting MarxEngels-A-2003_01-The_Collected_Works_of_Karl_Marx_and_Frederick_Engels_Early_Works_1835_1844_Volume_1-V0.xml VALIDATION ERROR on full parse by library in MarxEngels-A-2003_01-The_Collected_Works_of_Karl_Marx_and_Frederick_Engels_Early_Works_1835_1844_Volume_1-V0.xml ParseError: FoLiA exception in handling of

@ line None (in parent
@ parent line None) : [ValueError] Unable to add object of type Caption to Division . Type not allowed as child. Unable to convert MarxEngels-A-2003_01-The_Collected_Works_of_Karl_Marx_and_Frederick_Engels_Early_Works_1835_1844_Volume_1-V0.xml

I get a similar failure with another file, there the offending object is of type Table.

In this case, I have no idea what 'type Caption' is. Or how I might avoid that this conversion fails.

Please advise.

I attach the input file:

TEST.tei2folia.zip