scRNA-seq cell type annotation in the input requirement

digitalcytometry / ecotyper

EcoTyper is a machine learning framework for large-scale identification of cell states and cellular ecosystems from gene expression data.

Other

170 stars 41 forks source link

I am interested in ecotyper and thank you for the great analysis method!

I have my own scRNA-seq data and run the ecotyper following Tutorial 5. There are two input requirements: 1) expression matrix and 2) annotation matrix (e.g. scRNA_CRC_annotation.txt).

I am wondering if you can explain what is the requirement of the annotation matrix. In the tutorial section, it seems three columns are required: columns: ID, CellType, and Sample.

I would like to know what is the condition of CellType? Should it be annotated by cibersortx LM22 (I concluded this after I quickly overviewed the paper since the DLBCL bulk RNA is decomposed by cibersortx LM22)? If so, what is the best practice to run scRNA-seq data in cibersortx? From scRNA_CRC_annotation.txt, I found that it uses only one cell type at the CellType column. Does this mean it picks the cell type with the highest frequency?

Let me know if I am totally on the wrong track.

Thank you again!

digitalcytometry / ecotyper

scRNA-seq cell type annotation in the input requirement #11