marcus1487 / nanoraw

Genome guided re-segmention and visualization for raw nanopore sequencing data.
https://pypi.python.org/pypi/nanoraw
Other
46 stars 9 forks source link

test data #42

Open osilander opened 7 years ago

osilander commented 7 years ago

Any possibility of posting a small test data set, e.g. from E. coli? Cheers.

marcus1487 commented 7 years ago

I have been resistant to add a test data set to the code repository as the size of the FAST5 files compared to the code base is quite large. The small data set I use to test nanoraw is ~2GB and the whole code repository is only 450KB. Thus I think adding these files to the code would bloat the download too much. The other two options that I can think of are to 1) host the test data sets in another location or 2) create a very small synthetic data set that could be constructed. I think I would opt for option 1 as I am not sure how useful a synthetic data set would be anyways. I will try to get this first option up this week.

osilander commented 7 years ago

Thanks, that would be fantastic. Cheers.

On Jul 5, 2017, at 07:16, Marcus Stoiber notifications@github.com wrote:

I have been resistant to add a test data set to the code repository as the size of the FAST5 files compared to the code base is quite large. The small data set I use to test nanoraw is ~2GB and the whole code repository is only 450KB. Thus I think adding these files to the code would bloat the download too much. The other two options that I can think of are to 1) host the test data sets in another location or 2) create a very small synthetic data set that could be constructed. I think I would opt for option 1 as I am not sure how useful a synthetic data set would be anyways. I will try to get this first option up this week.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.