pulibrary / bibdata

Local API for retrieving bibliographic and other useful data from Alma (Ruby 3.2.0, Rails 7.1.3.4)
BSD 2-Clause "Simplified" License
16 stars 7 forks source link

Index ICPSR subject headings #2035

Open escowles opened 2 years ago

escowles commented 2 years ago

Request received and approved by DSSG:

In Voyager, ICPSR subject headings were indexed. In ALMA, they are not.

An example is https://catalog.princeton.edu/catalog/9949867353506421/staff_view While keywords, they provide important additional access to records. ICPSR uses a controlled vocabulary. We have traditionally not included Library of Congress Subject Headings since they were not included in the record.

They are part of the 650 field in the Marc record and have the subfield of 7.

Since keyword searching is the most common, could we once again have these indexed in the Catalog (ALMA) as they were in Voyager?

Re-index required?

kevinreiss commented 1 year ago

Reach out to Bobray to understand this a bit further. May be a question about seeing these in the Alma backend.

mzelesky commented 1 year ago

See https://docs.google.com/spreadsheets/d/1kn4ixrZcvjdP3UPG1zL2ytfFO16OUN77ZWyT9ous9hM/edit?usp=sharing for an analysis of the frequency of each ICPSR heading.

Some cleanup of the MARC records should happen alongside this effort, to make the $2 values consistent (the official value is icpsr in lowercase). Also, some errant $0 values are in the data (LCSH URIs even though they are ICPSR headings).

kevinreiss commented 1 year ago

Mark estimates the clean-up work is an hour or two.

kevinreiss commented 1 month ago

@mzelesky is the clean-up mentioned in the last comment something that could happen in the next 4-6 weeks? We'd like to possibly add this ticket to a sprint on OL features in December?