Nanostring-Biostats / InSituType

An R package for performing cell typing in SMI and other single cell data
Other
29 stars 11 forks source link

findAnchorCells speed improvements #83

Closed patrickjdanaher closed 2 years ago

patrickjdanaher commented 2 years ago

(The goal is to avoid calculating logliks for all cells * profiles, which is slow.)

Plan:

patrickjdanaher commented 2 years ago

Exact alg:

  1. calc cosines
  2. ID rows with max > thresh
  3. For each cell type:
    • get subset of rows flagged in (2) AND with the cell type's cosine > 0.5*thresh
    • get lldist over that subset
patrickjdanaher commented 2 years ago

done.