pulibrary / bibdata

Local API for retrieving bibliographic and other useful data from Alma (Ruby 3.2.0, Rails 7.1.3.4)
BSD 2-Clause "Simplified" License
16 stars 7 forks source link

Record created without format #2134

Closed maxkadel closed 1 year ago

maxkadel commented 1 year ago

Expected behavior

All works should have a "format" in their Solr document

Actual behavior

Work is indexed without a format - see

Steps to replicate

Index SCSB-7935196 to solr

Impact of this bug

All works should have a format, and if they do not this should raise an error that brings the work to the attention of catalogers. Without a format, users may have difficulty with discovery, and understanding how to use or access the work.

Re-index required?

Implementation notes, if any

maxkadel commented 1 year ago

Solr document at the time of ticket creation

{
  "id": "SCSB-7935196",
  "numeric_id_b": false,
  "other_id_s": [
    ".b106574619"
  ],
  "author_citation_display": [
    "Freemasons"
  ],
  "author_roles_1display": "{\"secondary_authors\":[\"Freemasons\"],\"translators\":[],\"editors\":[],\"compilers\":[]}",
  "author_s": [
    "Freemasons. Santo Domingo. Logia Cuna de América, no. 2."
  ],
  "title_display": "Analectas; periódico ecléctico de difusión cultural.",
  "title_t": [
    "Analectas; periódico ecléctico de difusión cultural."
  ],
  "title_citation_display": [
    "Analectas; periódico ecléctico de difusión cultural"
  ],
  "compiled_created_t": [
    "Analectas; periódico ecléctico de difusión cultural."
  ],
  "pub_created_display": [
    "Santo Domingo."
  ],
  "pub_created_s": [
    "Santo Domingo."
  ],
  "pub_citation_display": [
    "Santo Domingo"
  ],
  "description_display": [
    "7 v. 36 cm."
  ],
  "description_t": [
    "7 v. 36 cm."
  ],
  "number_of_pages_citation_display": [
    "7 v."
  ],
  "notes_display": [
    "Microfilm.",
    "Weekly.",
    "Issued by the Club masónico Cuna de América.",
    "No more published?"
  ],
  "language_name_display": [
    "English"
  ],
  "language_facet": [
    "English"
  ],
  "language_code_s": [
    "eng"
  ],
  "language_iana_s": [
    "en"
  ],
  "related_name_json_1display": "{\"Related name\":[\"Freemasons. Santo Domingo. Logia Cuna de América, no. 2.\"]}",
  "lccn_display": [
    "44017511"
  ],
  "lccn_s": [
    "44017511"
  ],
  "holdings_1display": "{\"8076325\":{\"location_code\":\"scsbnypl\",\"location\":\"Remote Storage\",\"library\":\"ReCAP\",\"call_number\":\"*DS+ (Analectas)\",\"call_number_browse\":\"*DS+ (Analectas)\",\"items\":[{\"holding_id\":\"8076325\",\"enumeration\":\"v. 1-4 (July 1933-June 1934)\",\"id\":\"13278989\",\"status_at_load\":\"Available\",\"barcode\":\"33433088591924\",\"copy_number\":\"1\",\"use_statement\":\"In Library Use\",\"storage_location\":\"RECAP\",\"cgd\":\"Shared\",\"collection_code\":\"NA\"},{\"holding_id\":\"8076325\",\"enumeration\":\"v. 5 (July-Sept. 1934)\",\"id\":\"13278990\",\"status_at_load\":\"Available\",\"barcode\":\"33433088591932\",\"copy_number\":\"1\",\"use_statement\":\"In Library Use\",\"storage_location\":\"RECAP\",\"cgd\":\"Shared\",\"collection_code\":\"NA\"}],\"location_has\":[\"v. 1-4 (July 1933-June 1934), v. 5 (July-Sept. 1934)\"],\"supplements\":[null],\"indexes\":[null]}}",
  "recap_notes_display": [
    "N - S"
  ],
  "location_code_s": [
    "scsbnypl"
  ],
  "location": [
    "ReCAP"
  ],
  "location_display": [
    "Remote Storage"
  ],
  "advanced_location_s": [
    "scsbnypl",
    "ReCAP"
  ],
  "call_number_display": [
    "*DS+ (Analectas)"
  ],
  "call_number_browse_s": [
    "*DS+ (Analectas)"
  ],
  "hashed_id_ssi": "429a335a0768c05b",
  "_version_": 1761538598635569152,
  "timestamp": "2023-03-27T16:21:58.790Z"
}
kevinreiss commented 1 year ago

@mzelesky not urgent but if you could give this a quick look in the data from SCSB. This is an odd case.

mzelesky commented 1 year ago

The leader on the record is corrupt. To get more detail, I will have to look at the last full dump that has the record.

However, it looks like several characters are missing. Positions 5-6 are missing, as well as position 8.

As a result, the format of the book (the 's' character) is in the wrong place.

01334s a2200337 a 4500