quick question about filtering of genes

HelenaLC / muscat

Multi-sample multi-group scRNA-seq analysis tools

160 stars 32 forks source link

Huh, I totally get why this is confusing / seems arbitrary. There is really not "magic" here, rather, this is a hacky workaround for an issue I encountered during development:

Aggregation might use different summary statistics (say, sum or mean or median) and different assay data (say, counts or expression-like values). Meanwhile, edgeR's filterByExpr() is designed for count-like data... So the > 100 check is hoping to check "Do these look like counts?" (Well, sum of single-cell counts, really) Before having this in place, filterByExpr() would remove everything when aggregateData() had been called with, for example, mean of logcounts... Hope that makes sense!

HelenaLC / muscat

quick question about filtering of genes #100