dahak-metagenomics / dahak

benchmarking and containerization of tools for analysis of complex non-clinical metagenomes.
https://dahak-metagenomics.github.io/dahak
BSD 3-Clause "New" or "Revised" License
21 stars 4 forks source link

Storing big files (issue-collecting issue) #53

Open charlesreid1 opened 6 years ago

charlesreid1 commented 6 years ago

There are currently several open issues about large files and a few ideas, so I thought I would create a single issue to collect references to these issues, summarize possible solutions, and provide links to some resources.

Open issues

(Any objections to closing these issues?)

Solutions

A few proposed solutions:

Topics for Discussion

Can we pin numbers on our requirements to get a sense of how much this might cost? (If the budget is zero, that's useful to know too!)

There are some cloud workflows for avoiding large downloads as well, depending on the constraints and where we want to dedicate time. These would definitely be useful in the context of testing.