emory-libraries / dlp-selfdeposit

0 stars 0 forks source link

Review MODS publisher entries and defaults #565

Closed eporter23 closed 2 weeks ago

eporter23 commented 1 month ago

In our original sample data set of 59 works, the majority of Publisher entries are defaulting to "Emory University Libraries" which was identified as our fallback entry if no publisher was supplied, even though some have existing entries in the mods:publisher statement. This sample set included a mix of all the different Publication Types. We have also done 2 larger loads of data since the original set was loaded.

We can expect that Presentations and Posters will likely not have formal Publishers identified. However, Books and Book Chapters and Conference Papers should have Publishers in their MODS records.

To determine: is this due to the mapping identified in the original work, or some other cause?

Examples: Book Chapters Books Conferences (aka Conference Papers)

Specific examples, in which you can see the MODS values for publisher defaulting to "Emory University Libraries"

https://oe24-test.libraries.emory.edu/concern/publications/913b669d-158b-402e-a5b7-f96f93e48115?locale=en See original: https://open.library.emory.edu/publications/emory:vpnf0/

https://oe24-test.libraries.emory.edu/concern/publications/aadb81b4-5510-4746-983f-58e94907b199?locale=en See original: https://open.library.emory.edu/publications/emory:thgdz/

https://oe24-test.libraries.emory.edu/concern/publications/15080c97-3ed6-4c72-a45b-cff29460a09e?locale=en See original: https://open.library.emory.edu/publications/emory:s6nvm/

eporter23 commented 4 weeks ago

SOLR results identifying the default "Emory University Libraries" entry added for Publications after ~1800 publications (mostly Articles) have been loaded into oe24-test: 47 total

eporter23 commented 2 weeks ago

@bwatson78 I am thinking maybe the best option would be to remove the default assignment of "Emory University Libraries" if a publisher entry doesn't exist in the expected MODS location. Let me know your thoughts, and I'll write up another ticket if needed to proceed. My thought is, it may be easier to identify this after the data is loaded and hopefully the lack of Publisher entries won't be present in too many records.

eporter23 commented 2 weeks ago

To do: write up a new ticket for removing the auto-assignment of Publisher.