lasigeBioTM / MER

Minimal Named-Entity Recognizer (MER)
http://labs.fc.ul.pt/mer/
56 stars 8 forks source link

Index when "\xb5" and other symbol present in text to annotate. #14

Closed LLCampos closed 7 years ago

LLCampos commented 7 years ago

Running the command

bash get_entities.sh 1 T "µ testosterone" HMDB_ChEMBL_ChEBI

Gets us:

1 T 3 15 0.59757 testosterone unknown 1

Which is wrong, since the initial index should be 2, not 3. This happens with other characters also.

This is different from https://github.com/LLCampos/IBELight/issues/11.