biolab / orange3-single-cell

šŸŠšŸ”¬ Orange add-on for gene expression of single cell data
https://singlecell.biolab.si/
Other
17 stars 25 forks source link

Single-cell Preprocess: Add TF-IDF #339

Open mstrazar opened 5 years ago

mstrazar commented 5 years ago

An alternative to log(CPM+1) transformation of count data is the TF-IDF transform, adopted from text analysis. Similar to finding characteristic words describing a topic in the document, TF-IDF can be used to find stand-out genes ("terms") for each cell ("document").

It should be relatively straightforward to include this approach into Single-cell preprocess.

See https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-018-4922-4

gabriellayi commented 4 years ago

Hi,

I read your papers about this method, however Iā€™m new to coding, do you mind sharing the code or telling me where I can find code or tutorial for this method which id like to apply for scRNA-seq Gene clustering?

Thanks!! Yi