LibraryOfCongress / bagger

The Bagger application packages data files according to the BagIt specification.
Other
120 stars 19 forks source link

Bags created and saved/serialized as .zip need to preserve original creation date #16

Open houzanme1 opened 8 years ago

houzanme1 commented 8 years ago

When a new bag is created and saved to a new location as zip files, the record in the data folder are all changed to the current date inside the zip archive. Due to the requirement for data integrity, any data in that folder should keep the original created date.

johnscancella commented 8 years ago

@houzanme1 Can you please provide a step by step example to reproduce the error? Could you also include an example of the expected behavior?

Thanks

houzanme1 commented 8 years ago

Sorry I just updated the issue. If you create a zip bag, the issue is reproduced. When I tested the standard bag with no zip, there is no issue there. Expected behavior : inside the zip bag, the "data" folder content would have their original creation date either 1) when opened as archive, or 2) when extracted.

johnscancella commented 8 years ago

I am curious, how often do you save as a zip from bagger?

houzanme1 commented 8 years ago

Frankly, it is sporadic and depends 1) whether I am selecting a few files or 2) I have imaged a floppy disk, or 3) when it is a website (we can think of social media too) which come with many files.. In the second case, I like to keep the image and its extraction together as one. In the third case, a wrapper definitely helps.

But as it is working fine with selected files (without) zipping the output, I am fine with it at the moment.

I do want to point to this though, that if one wants to encapsulate the "record" with its metadata for preservation --the encapsulated AIP--, it will be helpful to have the original, the migrated copy, the access copy and the metadata structured correctly before zipping together. Candidate formats are of course zip, but also warc or sirf.

We can certainly talk about those options, later when you have time. As of now, keeping metadata order trumps all of those needs. Tibaut

Best Regards,

Tibaut Houzanme

Mobile: +1-317-332-3296 email: houzanme@gmail.com

On Thu, Feb 18, 2016 at 2:48 PM, John Scancella notifications@github.com wrote:

I am curious, how often do you save as a zip from bagger?

— Reply to this email directly or view it on GitHub https://github.com/LibraryOfCongress/bagger/issues/16#issuecomment-185887658 .