buda-base / lds-queries

A repository for BUDA Linked Data Server
Apache License 2.0
0 stars 1 forks source link

query to generate IGRID metadata from BUDA for IA #88

Closed TBRC-TimB closed 1 year ago

TBRC-TimB commented 2 years ago

The last bit of metadata we still generate from tbrc.org for the IA process is the image group level metadata. we need one file for each Volume in a Work. Usually we generate these by looping through the IGRIDs and making individual calls to tbrc.org.

Sample data: Typically the files are .xml but github blocks that file format. I1KG21234.txt I1KG21234.zip

eroux commented 2 years ago

Here's a query you can use for the script : https://ldspdi.bdrc.io/query/table/AO_IGinfoInW?R_RES=bdr:W1PD96684

it lists the volume IDs, volume number, number of tbrc intro pages (which should in theory be used in the php to determine the thumbnail) and the total number of pages (which will be < 3 for an unsynced volume). The list is ordered by volume number.

One potential benefit is that IIRC the IA script just takes the xml order of the image groups as their actual order, which isn't always true (see this example with volume 62 at the end), while this query sorts by volume number

That will probably require some adaptations to the script, but BUDA doesn't support tbrc.org xml files

eroux commented 2 years ago

note that the image list is not in the database so you'll have to get it from the dimensions.json file (or through the API call). In the future we also might want to consider listing images with their corrected order and pagination from the bvm, that would add some value over our current upload... but it's a little more complex, we can keep it for later