psu-libraries / scholarsphere-3

A web application for ingest, curation, search, and display of digital assets. Powered by Hydra technologies (Rails, Hydra-head, Blacklight, Solr, Fedora Commons, etc.)
Apache License 2.0
78 stars 24 forks source link

Collect File Information for Contents of Zips/Tars #1578

Open srerickson opened 5 years ago

srerickson commented 5 years ago

There are a fair number of .zip, .tar.gz and other compressed file archives in ScholarSphere, particular among the data sets. It would be useful for users, curators, and administrators to have a sense of what's in them without having to download and open the files. Ideally, we would extract a manifest of the contents of archive files when they are uploaded and save it as file set metadata.

awead commented 5 years ago

I think this might be a good enhancement for version 4. It's possible we could do this in the current version, but might require some additional work that wouldn't be portable to the next version.