gmarcais / Jellyfish

A fast multi-threaded k-mer counter
Other
460 stars 136 forks source link

Difference between reading concatenated PE reads and merging paired end .jf outputs independently #204

Open JWrighty97 opened 5 months ago

JWrighty97 commented 5 months ago

I have been finding my way through kmer counting some illumina paired end read sequence data for my PhD (10 files, 5 samples). When putting the data through jellyfish I have found massive difference between the outcomes after further downstream analysis i.e GenomeScope.

Attached is the images of the downstream analysis with concatenating a library (four files) before going through jellyfish and using jellyfish merge to merge the .jf outputs of each file independently.

Is this how you would recommend using your software?

Best wishes,

transformed_linear_plot transformed_linear_plot