nf-core / methylseq

Methylation (Bisulfite-Sequencing) analysis pipeline using Bismark or bwa-meth + MethylDackel
https://nf-co.re/methylseq
MIT License
137 stars 133 forks source link

Specify additional "genomes" of spike-in controls for downstream calculation of bisulfite conversion efficiency #394

Open pdemko opened 4 months ago

pdemko commented 4 months ago

Description of feature

Library preparation kits come with unmethylated and methylated DNA controls that are added to samples (e.g. NEBNext EM-seq). It would be useful to supply methylseq with the fasta sequences of the controls and have it perform additional alignment, deduplication, and extraction using those references. This would help enable calculation of bisulfite/enzymatic conversion efficiency.

Thank you for providing this excellent pipeline!

cjfields commented 3 months ago

At the moment, we're basically including them as part of the main reference genome FASTA and analyzing outside of methylseq. Having a way to include them separately would also be useful so additional spike-in specific analyses could be performed.