openpreserve / jhove

File validation and characterisation.
http://jhove.openpreservation.org
Other
171 stars 79 forks source link

Jhove errors - migrating a large number of historical files #916

Open JuliaWahlund opened 8 months ago

JuliaWahlund commented 8 months ago

Hello!

My name is Julia Wahlund and I work as a product manager at the National Library of Sweden. We are currently in a migration project where we are migrating a lot of digital files to our preservation platform. While we are doing this, we are validating our files using JHOVE. We have encountered some issues previously that we have fixed, but now we have more issues with files not being able to validate.

We have divided it up into files that can be opened/rendered and files that cannot be opened.

Errors for files that CAN be opened are:

  1. 2311331 JHOVE_ERR: "Not well-formed" : "File is too short"
  2. 974933 JHOVEERR: "Not well-formed" : "No TIFF header: ¢"
  3. 974937 JHOVE_ERR: "Not well-formed" : "Premature EOF"
  4. 2311324 JHOVE_ERR: "Not well-formed" : "No TIFF header: r"

Errors for files that CANNOT be opened are:

  1. 2278540 JHOVE_ERR: "Not well-formed" : "No TIFF header: 8B"
  2. 2023591 JHOVE_ERR: "Not well-formed" : "Type mismatch for tag 36864; expecting 7, saw 2"
  3. 2311329 JHOVE_ERR: "Not well-formed" : "Unknown TIFF IFD tag: 34152"
  4. 974935 JHOVE_ERR: "Not well-formed" : "Unknown data type"

Do you have any suggestions on how to fix these files? Last time we had an error for not well formed tiffs, we resaved them after talking to some experts within digital preservation area and then they validated. And then we saved some preservation metadata for this action. But these errors are hard to find information around, so we are not so sure what we can do about it.

Looking forward to hearing from you,

Best regards,

Julia Wahlund

JuliaWahlund commented 8 months ago

My bad. It is the other way around on the errors. So first title and section should be "cannot be opened" and the second title and section should be "can be opened"

GeorgiaMoppett commented 8 months ago

Hi Julia,

Thank you for your logged issue! We're having a look at this now, and will follow up soon.