frisen-lab / TREX

Simultaneous lineage TRacking and EXpression profiling of single cells using RNA-seq
MIT License
5 stars 6 forks source link

Tune low-complexity filtering #43

Open marcelm opened 1 year ago

marcelm commented 1 year ago

Low-complexity filtering currently discards molecules for which the entropy of the full cloneID is below 1.0. There are some decisions that could be made in other ways that could potentially lead to better filtering results.

  1. Use a different threshold
  2. Change what is done with deleted/missing bases (0 and -). As suggested by @acorbat, we could compute entropy on the cloneID without them and rescale entropy.

See https://github.com/frisen-lab/TREX/issues/41#issuecomment-1757614961_