short reads for correction

Hi @PetrNguyen,

I apologize for the delay, I missed the notification of your issue. Adapters should definitely be trimmed from the short reads before using them for correction. Filtering is up to you, it depends on what kind of filtering you want to do but I assume something rather minimal would do. When it comes to using short reads from individual A to correct the long reads from individual B, I would advise against using Ratatosk, even for mixed-haplotype assembly purposes. A lot of k-mers overlapping variants of individual B won't be found in the short reads of A, hence limiting the anchoring of long reads on the graph built from the short reads. The SNP candidate detection embedded within Ratatosk will be useless and will probably confuse the correction. Colors representing short reads mapping to the graph will guide the correction towards incorrect paths. And those are only a couple of issues coming on top of my head, there are many other issues with using short/long reads from different individual in Ratatosk.

Guillaume

DecodeGenetics / Ratatosk

short reads for correction #24