psu-libraries / scholarsphere-3

A web application for ingest, curation, search, and display of digital assets. Powered by Hydra technologies (Rails, Hydra-head, Blacklight, Solr, Fedora Commons, etc.)
Apache License 2.0
78 stars 24 forks source link

remove empty string from resource_type #1617

Closed srerickson closed 4 years ago

srerickson commented 5 years ago

The blank resource type (#1396) is still lingering in some objects' resource type field. For example:

{
        "has_model_ssim":["GenericWork"],
        "id":"5712m6699",
        "resource_type_tesim":["", "Other"]}]
  }

This csv (which I've renamed to .txt to agree with Github's restrictions on files that can be attached to issues) includes a list of objects that need to be fixed, along with the corrected value for the resource type. Columns are: id, has_model_ssim, OLD_resource_type_tesim, NEW_resource_type_tesim. Hopefully this file can be used for a batch update.

awead commented 5 years ago

@srerickson I went ahead and updated the resources in Scholarsphere listed in the CSV. Re-run the query and see if we've got them all.

srerickson commented 5 years ago

Looks good. I don't see any Works the the empty resource_type. "unspecified" is also gone as an option for the resource type on the search page, which is great.