acl-org / acl-anthology

Data and software for building the ACL Anthology.
https://aclanthology.org
Apache License 2.0
406 stars 280 forks source link

Name misspelled in anthology, possible conversion bug #71

Closed evanmiltenburg closed 6 years ago

evanmiltenburg commented 6 years ago

It seems that my name got misspelled in the COLING proceedings bib file, so instead of "Emiel van Miltenburg", it reads "Emiel Miltenburg" (dropping the 'van'). I clicked my name to see if this happened in other places as well, and found that it also happened with the IWCS proceedings.

https://aclanthology.coli.uni-saarland.de/people/emiel-miltenburg

So far, this could be a data entry problem. But: the IWCS proceedings do list my name correctly: http://www.aclweb.org/anthology/W15-0110.bib

So the system has two versions of my name for the same conference (at least for IWCS). Maybe something went wrong with the conversion between the old anthology and the new one?

I don't know if there are guards against misspelling of author names, but if there are, then having the misspelled version of my name may lead to more wrongly spelled entries.

evanmiltenburg commented 6 years ago

Ah, this seems to be related to #70. Thanks to Andreas for noticing this on Twitter. I suppose these two issues could be merged, then.

CTNLP commented 6 years ago

This is indeed the same problems as #70 I will try to fix it the same way as indicated there, we will also see if the tag is used anywhere else in the xml files. Thanks for pointing this out.

knmnyn commented 6 years ago

Just FWIW, there's no guard on name forms / variants in the Anthology as of now. We'd love to have people volunteer time to solve those problems (edit distances or some easy fix, anyone)?