marbl / CHM13

The complete sequence of a human genome
Other
883 stars 96 forks source link

Access to ONT fasta format reads? #4

Closed bitcometz closed 4 years ago

bitcometz commented 5 years ago

Hello, The fast5 for ONT data is so big. Is there any access to all the ONT reads or the long ONT reads (99 Gbp of data in reads >50 kbp, 32x) in the format of "fasta.gz". It would be much more convenient to download the data and usually doing assembly or SV detection will not need the quality information.

Thanks!

skoren commented 5 years ago

There is a fastq.gz file for the all the reads, it's 140g zipped so you could just extract the reads >50kb from there? Is that not sufficient for what you want to do?

bitcometz commented 4 years ago

Thanks ! Could you give the URL to download the 140g zipped file.

Best

skoren commented 4 years ago

It's in the readme under the rel2 section: https://github.com/nanopore-wgs-consortium/CHM13#rel2-genomic-dna. There are also instructions to download via AWS tools for faster transfer though wget will work on the posted links.