kosaidtu / norgal

Under construction...
MIT License
1 stars 1 forks source link

Data too big, is there a need to sub sample my data? #4

Open Silverfoxcome opened 5 years ago

Silverfoxcome commented 5 years ago

Hi!

I'm using norgal to assemble the mitogenome of purple maize (aprox. 560 000 bp). My dataset is of 37,6 GB for the two fastq.gz (compressed) files. Decompressed they are aprox. 300 gb each. Do you think I should sub sample my files with tools like seqtk or just head? How much reads should I have to obtain the mitogenome? How much time do you think will take the program to run with my data?

Thank you in advance :)

matryoskina commented 4 years ago

I would like to know the answer as well