Closed pdaengeli closed 4 weeks ago
The IIIF server isnn't coping well with the bags created so far, likely due to their size.
Thus adjusting the strategy:
bogabag_0056
containing Sheet-0056000
through Sheet-0056877
)The first try was successful and took ca 4 hours for just the IIIF ingest process (bogabag_0056
)
find . -name 'Sheet-0056*.tif' | wc -l
) and compare completeness of the ingest (delivery counts are given above)
NB: needs to run over all files as there is a structural mismatch
Edit: doing this as diffs of file lists as it is more reliable/informative than countsget list of resstore files:
ssh dh@130.92.252.28 find /tmp/boga_herbarium_bernense/Picturae -name \'Sheet-00*.tif\' -printf \'%f\\n\' | sort > comparison/tif-list-all-remote.txt
get metadata from IIIF server:
sort column G and paste contents to comparison-tif-list-all-local.txt
diff comparison/tif-list-all-remote.txt comparison/tif-list-all-local.txt > comparison/comparison-all.txt
A clean diff :)
Thus skipping remaining tests and closing here.
Goal
Ingest of
tif
images fromresstore.unibe.ch/boga_herbarium_bernense/Picturae
toiiif.ub.unibe.ch
.Target structure:
Sample requests (collection slug + file name):
Process flow
rsync
with--include-from include.txt --exclude '*'
to selectively copy fromresstore
to NASfor file in **/*.tif; do dir=$(echo $file | cut -d. -f1); mkdir -p $dir; mv $file $dir; done
)sudo bagit.py --contact-name 'XY' --md5 bogabag_00XXX
sudo rsync -r bogabag_00* /volume1/docker/delivery/
)rm -r /volume1/docker/delivery/_processed/bogabag_00XXX
) + clean-up scratch areaProgress control
Fetching to NAS and restructuring
bogabag_00000-000219
)bogabag_00021-00046
)bogabag_00046-00074
)bogabag_00074-00105
)bogabag_00105-00132
)BOX-0000158
(Sheet-0015350.tif
), done, but in differing structure;bogabag_00132-00164
)bogabag_00164-00205
)bogabag_00205-00238
)bogabag_00238-00272
)bogabag_00272-00306
)bogabag_00306-00336
)bogabag_00336-00366
)bogabag_00366-00440
)bogabag_00440-00445
)bogabag_00445-00464
)bogabag_00464-00497
)bogabag_00497-00536
)bogabag_00536-00568
)IIIF ingest
bogabag_0000
(990tif
files; diff local remote :white_check_mark: )bogabag_0001
(835tif
files; diff local remote :white_check_mark: )bogabag_0002
(999tif
files; diff local remote :white_check_mark: )bogabag_0003
(944tif
files; diff local remote :white_check_mark: )bogabag_0004
(984tif
files; diff local remote :white_check_mark: )bogabag_0005
(999tif
files; diff local remote :white_check_mark: )bogabag_0006
(994tif
files; diff local remote :white_check_mark: )bogabag_0007
(995tif
files; diff local remote :white_check_mark: )bogabag_0008
(994tif
files; diff local remote :white_check_mark: )bogabag_0009
(990tif
files; diff local remote :white_check_mark: )bogabag_0010
(996tif
files; diff local remote :white_check_mark: )bogabag_0011
(1000tif
files locally and remote :white_check_mark: )bogabag_0012
bogabag_0013
bogabag_0014
bogabag_0015
bogabag_0016
bogabag_0017
bogabag_0018
bogabag_0019
bogabag_0020
bogabag_0021
bogabag_0022
bogabag_0023
bogabag_0024
bogabag_0025
bogabag_0026
(995tif
files; diff local remote :white_check_mark: )bogabag_0027
(998tif
files; diff local remote :white_check_mark: )bogabag_0028
(998tif
files; diff local remote :white_check_mark: )bogabag_0029
(983tif
files; diff local remote :white_check_mark: )bogabag_0030
(998tif
files; diff local remote :white_check_mark: )bogabag_0031
(999tif
files; diff local remote :white_check_mark: )bogabag_0032
(994tif
files; diff local remote :white_check_mark: )bogabag_0033
(982tif
files; diff local remote :white_check_mark: )bogabag_0034
(1000tif
files locally and remote :white_check_mark: )bogabag_0035
(993tif
files; diff local remote :white_check_mark: )bogabag_0036
(1000tif
files locally and remote :white_check_mark: )bogabag_0037
(999tif
files; diff local remote :white_check_mark: )bogabag_0038
(984tif
files; diff local remote :white_check_mark: )bogabag_0039
(926tif
files; diff local remote :white_check_mark: )bogabag_0040
(1000tif
files locally and remote :white_check_mark: )bogabag_0041
(1000tif
files locally and remote :white_check_mark: )bogabag_0042
(995tif
files; diff local remote :white_check_mark: )bogabag_0043
(1000tif
files locally and remote :white_check_mark: )bogabag_0044
(999tif
files; diff local remote :white_check_mark: )bogabag_0045
(1000tif
files locally and remote :white_check_mark: )bogabag_0046
(997tif
files; diff local remote :white_check_mark: )bogabag_0047
(993tif
files; diff local remote :white_check_mark: )bogabag_0048
(995tif
files; diff local remote :white_check_mark: )bogabag_0049
(1000tif
files locally and remote :white_check_mark: )bogabag_0050
(999tif
files; diff local remote :white_check_mark: )bogabag_0051
(999tif
files; diff local remote :white_check_mark: )bogabag_0052
(999tif
files; diff local remote :white_check_mark: )bogabag_0053
(1000tif
files locally and remote :white_check_mark: )bogabag_0054
(1000tif
files locally and remote :white_check_mark: )bogabag_0055
(999tif
files; diff local remote :white_check_mark: )bogabag_0056
(874tif
files; diff local remote :white_check_mark: )