lasigeBioTM / MER

Minimal Named-Entity Recognizer (MER)
http://labs.fc.ul.pt/mer/
56 stars 8 forks source link

Wrong matching when period (.) is part of term #33

Closed LLCampos closed 7 years ago

LLCampos commented 7 years ago

This issue is a follow-up of an alert given by BeCalm about our server responses. The data file "text_data" has the following contents:

α-amilase
dap.3

But when I do:

bash get_entities.sh 1 T DAP-3 test_data

I get:

1   T   0   5   0.378665    DAP-3   test data   1

When it should not return anything. The reason is obvious: "." is an wild-card to other special characters. So, when I do:

bash get_entities.sh 1 T DAP>3 test_data

I get:

1   T   0   5   0.378665    DAP>3   test data   1
LLCampos commented 7 years ago

Going to consider this a limitation, not a bug.