sul-dlss / happy-heron

Self-Deposit for the Stanford Digital Repository (SDR): H2 is a Rails web application enabling users to deposit scholarly content into SDR
Apache License 2.0
10 stars 2 forks source link

Research changes in H2 for GIS deposit #3456

Open edsu opened 5 months ago

edsu commented 5 months ago

@kimdurante has a user who would like to deposit GIS items potentially via H2. This ticket is to research what changes would be needed to allow an item deposited via H2 to be accessioned using gisAssemblyWF. Some things to consider include:

amyehodge commented 5 months ago

@edsu

kimdurante commented 5 months ago

Geospatial specific metadata (bounding boxes, projections, geometry types, and file formats) can be extracted from the data directly. One thing we would need to account for is the creation of ArcGIS/ISO metadata, but we could probably create that as well. Something else we would have to consider is that, currently, GIS data are structured as: 1 layer = 1 druid, so the deposit process would have to allow for that.

amyehodge commented 5 months ago

@kimdurante We should consider where/when that extraction would occur. The H2 database is the version of record for all items deposited via H2 (except in a few cases, but then depositors can never use H2 again to update those items). If the extraction occurs after H2 deposit ("outside" of H2), this would make these items another exception to the rule that staff then need to maintain for depositors. If it happens "inside" H2, then we need to be able to store that metadata in H2. Just something I want to be sure people are thinking about. I'm generally opposed, as I think is @andrewjbtw, to anything that creates more exceptions to the rule that users can maintain or themselves.

Edit: I'm thinking about this again and realizing that if the extraction happens "outside" H2 but every time, this may be fine? Basically, I just want to make sure these workflows are carefully considered so we don't end up with more exceptions to the rule or objects that have a tendency to get stuck when versioned etc.