We have more than 200,000 pcdm:Files (most are images probably but I can't confirm this) in our collection with multiple and unequal ebucore:dateCreated values.
It's easy to see how this is happening when comparing the fits output:
Line 23, and why I have quoted a block, presents a solution, used only for lastmodified (which is oddly not part of base_schema.rb, but I digress).
Would it make sense to similarly restrict created to only the ExifTool output? (That seems preferable seeing as NLNZ's 'created' date appears to be the modification date, assuming we can trust ExifTool and Tika for that information.)
If that is potentially too restrictive, is there a way to select a single creation date in a more sophisticated manner (if Exiftool, elseif Tika, else if...)?
We have more than 200,000 pcdm:Files (most are images probably but I can't confirm this) in our collection with multiple and unequal
ebucore:dateCreated
values.It's easy to see how this is happening when comparing the
fits
output:to the code that parses the fits document, line 26:
https://github.com/samvera/hydra-works/blob/d41be82540eca9d3a76301df3dcea5b74c9d5937/lib/hydra/works/characterization/fits_document.rb#L23-L26
Line 23, and why I have quoted a block, presents a solution, used only for
lastmodified
(which is oddly not part of base_schema.rb, but I digress).Would it make sense to similarly restrict
created
to only the ExifTool output? (That seems preferable seeing as NLNZ's 'created' date appears to be the modification date, assuming we can trust ExifTool and Tika for that information.)If that is potentially too restrictive, is there a way to select a single creation date in a more sophisticated manner (if Exiftool, elseif Tika, else if...)?