ContentMine / getpapers

Get metadata, fulltexts or fulltext URLs of papers matching a search query
MIT License
197 stars 37 forks source link

need a manifest file to preserve provenance info [important] #42

Open rossmounce opened 9 years ago

rossmounce commented 9 years ago

PMR and I strongly think we need a manifest.json of some sort to document in each search.

hence the suggested name of either 'manifest' or 'metadata' for this new JSON file.

If I do a search today, just by looking at the output I will have no idea in 7 days time what search I ran to get those results. PMR also thinks it's very important for downstream tools to have a manifest of all the files in the cmdir.

blahah commented 9 years ago

Yes this is a good idea. It's basically a log that we want, with a bit of structure that can be shared between getpapers, quickscrape, norma and ami. Simple to implement.