wcmc-its / ReCiter

ReCiter: an enterprise open source author disambiguation system for academic institutions
Apache License 2.0
45 stars 23 forks source link

Feature Generator outputs in a single article suggestion pieces of two separate article records #489

Closed paulalbert1 closed 2 years ago

paulalbert1 commented 2 years ago

Problem

As reported by Drew... ReCiter suggests a single PubMed record for stm2006, which is a combination of two different records.

Screen Shot 2021-12-21 at 11 32 25 AM

Diagnosis

The record for 34591265 is correct in efetch (first author is Evelyn Ho):

The record for 34591265 is also correct in ReCiter PubMed API (first author is Evelyn Ho).

The record for 34591265 is NOT correct in ReCiter Feature Generator API (first author is James C.M. Brust).

I seem to remember trying "retrieve all records" and that didn't work. (I didn't re-try in case this is an opportunity for a bug fix.)

Solution

I have no idea why this is happening. It is interesting that 34029021 is a book, which we exclude.

paulalbert1 commented 2 years ago

This doesn't exist any more so it's kind of hard to troubleshoot. Closing for now.