cernopendata / opendata.cern.ch

Source code for the CERN Open Data portal
http://opendata.cern.ch/
GNU General Public License v2.0
661 stars 147 forks source link

preservation: create and export BagIt packages #648

Open tiborsimko opened 9 years ago

tiborsimko commented 9 years ago

Create BagIt packages and make them downloadable via the portal. Nuance: for small files, they can be included in the bags. For big files, only reference to their actual storage, plus checksums etc, will be stored in the bag. Hence the need for definition of "trusted third party repositories" so that archiver can run and produce proper bags.

jirikuncar commented 9 years ago

Is it related to FFT vs. FFL? Or do we need more options for BagIt to decide when we should include the file?

tiborsimko commented 9 years ago

Yes, we'd need new configuration option for archiver in order to decide on this. Let's not depend on FFL, because we'll probably use the FFL technique for smaller files too...