eellak / gsoc2018-spacy

[GSOC] Greek language support for spacy.io python NLP software
http://nlpbuddy.io/gsoc
MIT License
97 stars 9 forks source link

Inconsistent Dataset/Jsonl file #9

Open thak123 opened 4 years ago

thak123 commented 4 years ago

The dataset provided in the jsonl format has repeating values as labels for the same given spans.

This when loaded into spacy throws error as spacy doesnt support tagging same span with multiple entities.