Open martinpopel opened 2 years ago
Thanks for reporting! That's odd... The GUM build bot validator should have caught these. @yilunzhu - can you take a break from the tokenizer module and debug how these cases got past the validator? The warning should have been triggered during building here:
https://github.com/amir-zeldes/gum/blob/dev/_build/utils/validate.py#L614
I thought each entity should have the same
etype
in all its mentions. However, when loading the newest data from the dev branch into Udapi (udapy corefud.Load < en_gum-ud-train.conllu
), I get the following warnings: