Open cb4github opened 4 years ago
Hi @cb4github
STAR will not work well for mapping reads to distant species genomes... The maximum divergence it can deal with is ~3-5%. You can try to reduce --seedSearchStartLmax to 10, and increase --winAnchorMultimapNmax to 200 (or 500), but I doubt it will improve the results sufficiently.
Cheers Alex
Dear Alex,
Briefly (see X.Log.final.out for 2 reference genomes X below), I've gotten unmapped percentages of 99.83% and 99.27% in mapping ~200M reads (for 8 samples of 2 species) of shark to arguably distant reference genomes, coelacanth and elephant shark, respectively.
Can you suggest best practice for choosing mapping parameters to increase the mapped reads count as such while somehow controlling overmatching?
Please let me know if you need more info, thanks.
Best, CB