I wondered why the deduped (gzip'ed) files are by far larger than the original gzip'ed fastqs.
(e.g. 14G to 34G, my exremest example). When decompressing the files it turned out that the deduped files are indeed smaller than the input files. That is okay ..
Our fastqs are compressed with "max speed", so it seems super_deduper does not use any compression when writing gzip'ed output?
Hi,
I wondered why the deduped (gzip'ed) files are by far larger than the original gzip'ed fastqs. (e.g. 14G to 34G, my exremest example). When decompressing the files it turned out that the deduped files are indeed smaller than the input files. That is okay ..
Our fastqs are compressed with "max speed", so it seems super_deduper does not use any compression when writing gzip'ed output?
compressed (input/output):
uncompressed (input/output):
Did I get this right? Just curious ;-)
best, Sven