Closed keithamoss closed 8 years ago
What else have they stored in the JPEG metadata?
At least resolution (PPI), but what else?!
Use SQLite/PostgreSQL rather than JSON to make querying easier.
Need to exclude fieldbooks
SELECT fields->'filename' as do_filename, fields->'filesize' as do_filesize, img.filename, img.filesize, img.id as img_id FROM sro_digital_objects_collection AS "do", sro_images AS "img" WHERE fields->>'filename' = replace(img.filename, ' ', '_')
SELECT fields->'filename' as do_filename FROM sro_digital_objects_collection AS "do" WHERE NOT EXISTS
(SELECT replace(filename, ' ', '_') FROM sro_images WHERE fields->>'filename' = replace(filename, ' ', '_'))
From a quick comparison of file sizes it looks like they match pretty closely. calling this done!
SELECT MIN(width), MIN(height), MAX(width), MAX(height), AVG(width), AVG(height), MEDIAN(width), MEDIAN(height) FROM sro_images
Now let's stitch the two together so we can see if the files we have can be associated with SRO's digital objects collection
Dependencies: #10 Next: #9
Process
Potential Challenges