Closed frankligy closed 3 years ago
Hi Frank,
This reflects the previous version of the 10X Cellranger pipeline, which only allows counting one read (across all cells) from a particular position in the genome. It’s an aggressive filtering strategy to avoid PCR duplicates. In the latest version of Cellranger, 10X has relaxed this to allow reads that have unique (start, end, barcode) combinations. We may update our pipeline to reflect this change.
Thanks a lot for the reply!
Best, Frank
Hi,
Thanks a lot for the great tool and nice vignettes, which are super helpful!
I have a question regarding your RNA+ATAC tutorial (http://htmlpreview.github.io/?https://github.com/welch-lab/liger/blob/master/vignettes/Integrating_scRNA_and_scATAC_data.html). When calculating fragments counts overlapping with gene body and promoter region,
bedmap
was utilized and all cell barcodes that fall into gene or promoter region will be listed out in the output.However, I noticed that, the last column in the
fragments.tsv
file, which indicates how many duplicated reads in the cell that were observed for each fragment, seems to not be used in the counting process, meaning to say even if there are 10 reads that support the fragment, it will only be counted once, which was a bit confusing to me.Shouldn't we use both
echo-map-id
andecho-map-score
to also report the number reads associated with each fragment? (below is a excerpt from bedmap tutorial)Thanks, Frank