malariagen / pipelines

Pipelines for processing malaria parasite and mosquito genome sequence data.
MIT License
14 stars 13 forks source link

Input source for mosquito short read alignment pipeline #32

Closed alimanfoo closed 2 years ago

alimanfoo commented 4 years ago

For the mosquito short read alignment pipeline, I believe there are two main options regarding the input data source for the sequence reads:

There are some practical trade-off in these choices:

alimanfoo commented 4 years ago

cc @kemprg @gbggrant @tnguyensanger @JonKeatley112 @hardingnj

gbggrant commented 4 years ago

If we use the bam/crams and convert them into fastqs for input to the short read alignment pipeline, we lose all of the tags at that step. Do we then want to add a MergeBamAlignment step after bwa to add those tags back to the aligned bam?

alimanfoo commented 2 years ago

Assuming this is resolved and can be closed.