knaw-huc / textannoviz

GNU General Public License v3.0
1 stars 1 forks source link

Download/export individual plaintext transcripts, PageXMLs, and page images #63

Open kintopp opened 8 months ago

kintopp commented 8 months ago

Implement a means (from the detail view, but perhaps also from the search results view) to download/export individual plaintext transcripts, PageXMLs, and page images via links. Multiple user requests.

kintopp commented 7 months ago

https://globalise.canny.io/transcriptions-viewer/p/you-cant-download-page-images-or-transcripts-to-your-computer

marijnkoolen commented 1 week ago

For REPUBLIC we would like this as well. PageXML is as yet a bit unclear in the REPUBLIC case, since we're showing resolutions as units, which don't have a direct PageXML representation yet. I'm planning on implementing this in PageXML-tools to turn any document into a PageXML representation (similar to the current .json property to turn any document into JSON), but won't have time to do this before Oct/Nov. In the meantime we can offer e.g. JSON of the resolutions, or PageXML of the entire scan(s) associated with a resolution, which might be frustrating for users, but it's better than nothing.