CODAIT / Identifying-Incorrect-Labels-In-CoNLL-2003

Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.
Apache License 2.0
12 stars 2 forks source link

two misaligned/missing entities `[224, 257): ‘OCASEK GOVERNMENT OFFICE BUILDING’` and `[21, 24): ‘T&N’` #1

Closed xuhdev closed 4 years ago

xuhdev commented 4 years ago

Need to fix two warnings by the script on the training dataset:

[WARNING] Could not find [224, 257): 'OCASEK GOVERNMENT OFFICE BUILDING'
[WARNING] Could not find [21, 24): 'T&N'
xuhdev commented 4 years ago

Fixed in #3