Open awagner-mainz opened 5 years ago
This is closely related to a different (and not yet solved or even documented) issue: how to cite the research data as a whole, or particular works. Do we cite them using a version number? If so, do we use the version number of the research data package, the version number of a particular TEI dataset (even if citing the HTML version, for example)? When is the version number updated: when updating the TEI, or when all webhooks have completed? (If we use dates, similar questions may arise.)
I think this issue/ticket needs much more discussion. For example, webdata derived from the TEI data are still listed as desiderata for public archival, just like the TEI data themselves. But I thought we had (informally) decided to focus on archiving the TEI rather than the webdata? If so, we should remove these items from the "data to store".
In any case we can split the derivative data out of this issue. How we publish (versions of) derivative data is IMHO indeed something we need to discuss and plan for a longer term only.
I am eliminating them from the list above - and for now I refrain from adding a new ticket, pending discussion in the project.
I am (more or less, just need to find the time) about to write a github/zenodo integration that parses individual TEI files from a github release, extracts their metadata and ingests them individually to zenodo. Will have to fiddle around with piping zenodo's DOI back into the files, among other things.
But will that be even any help for the webapp? Is it related to this issue?
Finally, I suggest we upload the TEI files we have published already to individual zenodo records manually for now...
... (either gitlab or github)
Data to store:
and Simple)Plaintext filesRDF (XML, TTL, JSON-LD)iiif ManifestsMETS filespdf and ebook filesAnd I suggest we store only data for full works (and volumes), not passages.