tecoholic / ner-annotator

Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
https://tecoholic.github.io/ner-annotator/
MIT License
556 stars 164 forks source link

Unable to tag portions of a string #49

Closed kevinelliott closed 2 years ago

kevinelliott commented 2 years ago

Looking to tag portions of a string.

AAABBBCCCDDDEEE.FFFFFF.GGGGGGGG

You cannot tag AAA, BBB, CCC, DDD, EEE, FFFFFF, or GGGGGGG. It will select the entire "word" automatically. An override to allow for fragmented tagging would be helpful.

tecoholic commented 2 years ago

It is the intended behaviour. The software is created with the intention to tag the tokens - Tokens are individual "words". To make it easier to do that, selecting a single letter allows selecting the token. The effort required to do precise selection is way more than effort required to do rough selection. So, this won't be implemented. Sorry.