CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003

Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
Apache License 2.0
12 stars 2 forks source link

Sports teams in test split #38

Closed alanakbik closed 3 years ago

alanakbik commented 3 years ago

Again thanks for sharing this work. The annotations look much improved over the original CoNLL. Especially sports teams that originally were not well labeled are now much better.

However, is seems that some sports teams in the test split are still labeled as LOC, but I think they should be ORG. For example, in the first few sentences we see a couple of instances like this:

SOCCER NN I-NP O
- : O O
JAPAN NNP I-NP I-ORG
GET VB I-VP O
LUCKY NNP I-NP O
WIN NNP I-NP O
, , O O
CHINA NNP I-NP I-LOC
IN IN I-PP O
SURPRISE DT I-NP O
DEFEAT NN I-NP O
. . O O

But CC O O
China NNP I-NP I-ORG
saw VBD I-VP O
their PRP$ I-NP O
luck NN I-NP O
desert VB I-VP O
them PRP I-NP O
in IN I-PP O
the DT I-NP O
second NN I-NP O
match NN I-NP O
of IN I-PP O
the DT I-NP O
group NN I-NP O
, , O O
crashing VBG I-VP O
to TO I-PP O
a DT I-NP O
surprise NN I-NP O
2-0 CD I-NP O
defeat NN I-NP O
to TO I-PP O
newcomers NNS I-NP O
Uzbekistan NNP I-NP I-LOC
. . O O

Japan NNP I-NP I-LOC
coach NN I-NP O
Shu NNP I-NP I-PER
Kamo NNP I-NP I-PER
said VBD I-VP O
...

In these examples "CHINA", "Uzbekistan" and "Japan" should be ORG (the other teams "JAPAN" and "China" in this example are labeled as ORG).

frreiss commented 3 years ago

Thanks for catching those problems in the first document of the test fold! I must confess that I went over that document by hand at least twice and managed to miss those additional corrections :-\

The instance of CHINA in the title being tagged LOC was due to there being conflicting manual corrections. The original tag for that token was PER. One file corrected it to LOC, and another corrected it to ORG. I've changed the audited files so that both change that tag to ORG.

The remaining corrections just weren't caught by our ensembles of models, and I guess our eyes skated over them while we were looking at the problems that were flagged by the models. Go figure.

I've fixed all of these in #39.

Here are the corrections for the first document in the test fold BEFORE those additional changes: image

And here is the same file AFTER, with new corrections in red and the single modified tag in blue: image

Please let us know if you see anything additional problems we missed!

frreiss commented 3 years ago

Fixed in #39

alanakbik commented 3 years ago

@frreiss awesome, thanks for correcting this!