emory-libraries / blacklight-catalog

1 stars 2 forks source link

SPIKE: Look at deduplication that appears to be happening in Primo VE #1198

Closed lovinscari closed 2 years ago

lovinscari commented 2 years ago

From the email chain below, there appears to be some level of deduplication happening in PrimoVE that is causing some issues. Please document the findings and any suggested improvements to resolve the issue reported.

Hi Sofia and Kat,

I’ve been following up with Kristan Majors about this issue. While I am not entirely sure, it looks like there is some type of deduplication error where a book is linked to this serial record. Can either of you look at the links provided on the View It/Get It page and help us to route this correctly? This is the Alma-generated page, and Blacklight and SOLR are not generating these links. Kristan specified that the second link (“View Full Text – online resource from Oxford Univ. Pr.”) should not be associated with this serial record.

https://emory.primo.exlibrisgroup.com/discovery/openurl?institution=01GALI_EMORY&vid=01GALI_EMORY:services&rft.mms_id=9936495435402486

Thanks, Emily


These records are not connected in ALMA in any way. It appears that some deduplication is happening in PrimoVE. We have seen other examples of things combined in Primo that were not combined before. My only assumption of why they get de-duped – the book record is incorrectly coded as Journal. I will fix that and see if it would make a difference. I am not sure how soon it would republish, so we can see if my fix worked.

Do you know?


Sofia Slutskaya Head, Resource Description Woodruff Library | Emory University 404.727.0123 | sofia.slutskaya@emory.edu

eporter23 commented 2 years ago

@lisahamlett @libah Additional info from Sofia:

So my fix from yesterday did not work and I had to investigate some more. I still believe that deduping needs a bigger discussion, but I found a temporary solution that would allow us to address tickets and concerns in a meantime.

There is a job in ALMA Prevent FRBR and/or Dedup in Discovery that could be run on sets on records to force them to stop deduping. I tested it on a record in questions and it worked - https://emory.primo.exlibrisgroup.com/discovery/openurl?institution=01GALI_EMORY&vid=01GALI_EMORY:services&rft.mms_id=9936495435402486

So at least this one little issue is no longer an issue.

The information about de-duping in Primo VE is at https://knowledge.exlibrisgroup.com/Primo/Product_Documentation/020Primo_VE/Primo_VE_(English)/090Dedup_and_FRBR_for_Primo_VE/010Understanding_the_Dedup_and_FRBR_Processes_(Primo_VE)

The instructions about supressing de-duping is on the very bottom of this page.

libah commented 2 years ago

Informational: New England Journal of Medicine is appearing in Primo VE . You can see it here in sandbox: https://emory-psb.primo.exlibrisgroup.com/discovery/openurl?institution=01GALI_EMORY&vid=01GALI_EMORY:bernardo&rft.mms_id=9936495558502486

libah commented 2 years ago

Met with Sophia today and turned off deduping for the "services" view in primo ve/Alma. We tested several titles that are now appearing as expected.. "Natural History" results no longer show Oxford and appear as expected.

Sophia's examples:

Journal of vocational behavior 0001-8791

A Living Systems Theory of Vocational Behavior and Development

Shuggie Bain : a novel

The 1619 Project : a new origin story

lovinscari commented 2 years ago

I am going to close this ticket since Ann met with Sofia to review and turning off deduping in PrimoVE seems to have resolved the initial issue presented.