OpenPecha / prodigy-tools

Tools for OpenPecha's use of Prodigy
MIT License
0 stars 1 forks source link

Sampling Images for layout annotation #8

Closed 10zinten closed 1 year ago

10zinten commented 1 year ago
kaldan007 commented 1 year ago

top offset can be look at the property volumePagesTbrcIntro in the database, for instance in http://purl.bdrc.io/resource/I0886.ttl for the bottom we don't have any other option then using list spliting. To get image list we can either look on S3 or through https://iiifpres.bdrc.io/il/v:bdr:{img_grp_id}

kaldan007 commented 1 year ago

I think its better to go with 10 from top and 10 from bottom as we can ignore blank image using the ignore button.

ta4tsering commented 1 year ago

okay will look into it

ta4tsering commented 1 year ago

the link below is the script that takes a work_id, gives back a dict with imagegroup as key and s3_key of the top 10 and bottom 10 images from that imagegroup, excluding the tbrc intro images from the top. image sampling for layout analysis