sul-dlss / was_robot_suite

Robots for Web Archiving Service accessioning and dissemination
Other
0 stars 2 forks source link

Update content type for Web Archive Crawl objects #486

Open edsu opened 2 years ago

edsu commented 2 years ago

Due to improvements in wasCrawlDissemination it is now possible to browse Web Archive content using the webarchive-binary content-type:

https://argo.stanford.edu/catalog?f%5Bcontent_type_ssim%5D%5B%5D=webarchive-binary

However prior to this change this type was being overwritten with content type file. You can see some of these here:

https://argo.stanford.edu/catalog?f%5Bcontent_type_ssim%5D%5B%5D=file&q=web+archive&search_field=text

For consistency and to ease discoverability we should rewrite these content-types to be webarchive-binary.

mjgiarlo commented 2 years ago

@andrewjbtw believes this could be handled via a bulk action. Would need work in Argo.