Exported annotated data in CoNLL 2003 NER format cannot be imported in SpaCy.
SpaCy expects documents to be separated using -DOCSTART- -X- O O line and sentences with whitespaces as per its documentation for converting CoNLL-2003 NER format to json.
https://spacy.io/api/cli#convert
Should this be handled in the converter? If yes, I can push a PR to fix it.
Exported annotated data in CoNLL 2003 NER format cannot be imported in SpaCy. SpaCy expects documents to be separated using
-DOCSTART- -X- O O
line and sentences with whitespaces as per its documentation for converting CoNLL-2003 NER format to json. https://spacy.io/api/cli#convertShould this be handled in the converter? If yes, I can push a PR to fix it.