CGATOxford / UMI-tools

Tools for handling Unique Molecular Identifiers in NGS data sets
MIT License
491 stars 190 forks source link

all zero expression genes #545

Closed wangjiawen2013 closed 2 years ago

wangjiawen2013 commented 2 years ago

Hi, The count table is shroter than my gtf gene list sometimes, I doubt that umi-tools will discard genes that don't express in any cells. How to keep the all zero expression genes ? If so, the count table among different experiments can be concatenated easily, especially when there are thoudsands of count tables generated over a long time period.

TomSmithCGAT commented 2 years ago

Hi.

umi_tools count only has access the the aligned reads in the BAM, not the GTF/GFF, so if no reads are aligned to a gene in any cell, that gene will not be in the count table.

If you need to merge the counts table from multiple experiments, you can use an outer join. E.g