Open coltongrainger opened 5 years ago
I'd like to incorporate updates to metadata from transcription or OCR with this issue.
talking with Philip today, see https://unix.stackexchange.com/questions/2161/rsync-filter-copying-one-pattern-only/2503#2503 for an include / exclude format
I'd like to be prepared to extract metadata from
I'm looking for a data exchange format specification now. (Ideally json
before injecting to SQL?)
I wrote out functions for end users to recursively assign uuids and create csv templates here: scripts/2019-06-26-data-exchange-formatting.py
.
Sam mentioned that a harvesting metadata from a directory structure, e.g.,
platform
|
document
|
image (files)
could be achieved if data from archive
is unnormalized and placed redundantly throughout the directories, or if there's a separate archive.csv
file against which to make foreign key reference.
Importing logbooks could look like:
Eventually, I'd like updates to this prototype to be compatible with methods Zaihua Ji describes here: https://sea.ucar.edu/conference/2012/operational-dataset-update-RDA.