ropensci / datapack

An R package to handle data packages
https://docs.ropensci.org/datapack
44 stars 9 forks source link

Implement fetch.txt feature #59

Open nuest opened 8 years ago

nuest commented 8 years ago

The BagIt spec defines an optional file fetch.txt which includes external references to (potentially large) files that are not actually in the bag, but must be downloaded before validation (#58).

https://tools.ietf.org/html/draft-kunze-bagit-13#section-2.2.3

The contents to be stored to fetch.txt could be passed in a simple datastructure to serializeToBagit.

Just an idea, looking forward to feedback.

gothub commented 8 years ago

@nuest So when a DataPackage is serialized, the only action needed is to include the optional fetch.txt. When a package is validated, then then contents of fetch.txt are retrieved and checked against the manifest. Is this correct?

If this is the case, then this sounds like a great addition.

nuest commented 8 years ago

Yes, that's correct. I'll see if I can get to this some time later this year!