NCI-RBL / iCLIP

RNA Biology Pipeline to Characterize protein-RNA Interactions
https://rbl-nci.github.io/iCLIP/
MIT License
4 stars 2 forks source link

remove unique/multimap split before deduplication #16

Closed slsevilla closed 3 years ago

slsevilla commented 3 years ago

split into unique and multimap files before deduplication is not necessary for splice_aware branch of pipeline (NH tag is added already). update workflow.

current n_split.sam > unique.n_split.bam, mm.n_split.bam unique.n_split.bam, mm.n_split.bam > index and sort merge all unique.n_split bam files merge all mm.n_split bam files unique.all.bam, mm.all.bam > index and sort merge unique.all.bam and mm.all.bam merged.bam > index and sort dedup

updated n_split.sam > n_split.bam n_split.bam > index and sort merge n_split.bam merged.bam > index and sort dedup

slsevilla commented 3 years ago

resolved with commit e4ab6eda93a0356761aee2cf4e0f374f4cb5d7d8