wellcomecollection / alpha

Alpha version of a catalogue explorer for Wellcome Library (deprecated).
0 stars 0 forks source link

Update digitised status of records #45

Closed frankieroberto closed 8 years ago

frankieroberto commented 8 years ago

Some records will have been digitised since last time (and others we missed last time), so we should re-discover all the digitised assets.

Previously we did this by searching the database for records with metadata->'759'->0->>'a' LIKE '%dig%' AND NOT(metadata ? '856') (i.e. a 'digcode'), and then trying to download the JSON asset 'package' at http://wellcomelibrary.org/package/#{record_id}/

However we should be able to improve on this process by looking at the list of digitised B-numbers available at http://wellcomelibrary.org/resource/collections/access/all-open/ and downloading the packages for just those instead (possibly skipping any for which we have already downloaded the package).

frankieroberto commented 8 years ago

This has run, and we now have 93,257 digitised records.