UIUCLibrary / Speedwagon

Collection of tools and workflows for DS
Other
6 stars 4 forks source link

Speedwagon - Throw Error for missed page numbers #519

Open vbjohna opened 6 months ago

vbjohna commented 6 months ago

I had a package of files rejected from HT because a page number was skipped. Could Speedwagon throw an error if a page number is skipped or missing?

henryborchers commented 6 months ago

@vbjohna Could you give me an example here of something that would pass and something that would fail such a test?

vbjohna commented 6 months ago

Here is the email that I received from HT. Warin, Martin commented:

Hello digitizationservices, These three volumes were ingested and will be discoverable in HathiTrust within 48 hours:

99270486612205899 99522697412205899 99534121812205899 One volume has missing pages or issues with sequence numbers, and was rejected:

uiuc.99526683012205899: simple; punted at 2024-02-16 10:41:59
Missing file; Skip sequence number from 00000003 to 00000005; stage: HTFeed::VolumeValidator; Please let me know if you need assistance troubleshooting.

Best, Martin Warin, HathiTrust developer

vbjohna commented 6 months ago

Angela asked me to add: To have SW throw an error if the amount of files from the preservation folder and the access folder do not match. As for the error message, as much info as possible would be most helpful. Page count does not match for MMSID. The message from HT was super helpful. Missing file; Skip sequence number from 00000003 to 00000005 from MMSID ...