We could verify that a publication was done by a particular author looking up the authors matching that name and looking at their list of publications for the publication that is being looked up. If we find the publication we have found the author.
Any other suggestions?
dedupe may be helpful if we can get some training data.
How should we deduplicate authors?
We could verify that a publication was done by a particular author looking up the authors matching that name and looking at their list of publications for the publication that is being looked up. If we find the publication we have found the author.
Any other suggestions?
dedupe may be helpful if we can get some training data.