Edelweiss / hgv

Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Ägyptens
MIT License
1 stars 0 forks source link

Images for Digital Bodleian #154

Closed samosafuz closed 1 year ago

samosafuz commented 1 year ago

The Bodleian Library in Oxford has encoded its catalogue of Western Manuscripts in TEI xml, and has published the repo in GitHub. This xml is sufficiently well structured, such that it includes TM numbers for ancient texts as well as links to its Digital Bodleian portal where these links are available. It is therefore now possible (and desirable) to automate the updating of image links in HGV and DCLP.

Could you work up a script for doing so? The pertinent folders in the repo will be those with the prefixes Gr_ (for Greek, e.g., /collections/Gr_bib) or Lat_ (for Latin, e.g., /collections/Lat_bib).

It shouldn't have much trouble to compile the pertinent data: so, for example, in the file for MS. Gr. class. d 100 (p) one finds

                     <bibl subtype="full" type="digital-facsimile">
                        <ref target="https://digital.bodleian.ox.ac.uk/objects/3f5a2ed9-9d4d-4032-94c8-7d6c0fa259b3/">
                        <title>Digital Bodleian</title></ref> <note>(full digital facsimile)</note>
                     </bibl>

as well as a pair of TM numbers:

                  <altIdentifier type="external">
                     <idno type="TM">20349</idno>
                  </altIdentifier>
                  <altIdentifier type="external">
                     <idno type="TM">20343</idno>
                  </altIdentifier>

It's a shame these links don't take the form of DOIs or other stable identifiers, but we can hope that they're going to be around for a while.

jcowey commented 1 year ago

https://github.com/papyri/idp.data/pull/333

Now merged.