chanzuckerberg / single-cell

A collection of documents that reflect various design decisions that have been made for the cellxgene project.
MIT License
4 stars 2 forks source link

Project-specific notebooks for CZI Hosted Projects (formerly Tier 2) and CZI Maintained Projects (formerly Tier 1) are embedded in the site #591

Closed pablo-gar closed 9 months ago

pablo-gar commented 11 months ago

Story: I am a computational biologist who wants to get Census integrated embeddings and their models to perform downstream analysis in my platform and toolkit of choice, for example scanpy in Python

Substory: I know different integration approaches have different properties, and I would like to gain a high-level understanding of the analytical potential that each different integration has. I can follow analytical examples to ground my understanding. For example:

Product requirement: Project-specific notebooks for CZI Hosted Projects (formerly Tier 2) and CZI Maintained Projects (formerly Tier 1) are embedding in the site

High-priority demonstations

  1. Visualizing a subset of cells in a 2D scatter map
  2. Analysis on the neighbors graph for interesting findings.
  3. Projecting any cells (census and non-census) into the embedding space.
  4. Inference workflow with the actual model (not the embeddings):
    1. e.g. Cell type
    2. e.g. in-silico experimentation
atarashansky commented 11 months ago

Some notes for me

Model candidates: Tier 2: Jure's model, scGPT, Tim NMF factors Tier 1: SCVI, GeneFormer

GeneFormer + SCVI are the only ones with embeddings available atm. Data will need to be subsetted in the tutorial notebooks.

Prioritize Tier 1, then move on to Tier 2.

dsadgat commented 11 months ago

@atarashansky could you create tickets here please? Ty!

dsadgat commented 9 months ago

@atarashansky could you please close the completed tickets? And perhaps close this epic (as its completed, correct?) TY!