JonathanShor / DoubletDetection

Doublet detection in single-cell RNA-seq data.
https://doubletdetection.readthedocs.io/en/stable/
MIT License
81 stars 23 forks source link

Loading in raw matrix #143

Closed BridgetPK closed 2 years ago

BridgetPK commented 3 years ago

Hi, The current notebook tutorial shows using the h5 file rather than the matrix.mtx. However, the code on the readme shows that you can use the matrix by loading in as raw counts per the code below. When I do this, I get an error that states "ValueError: could not convert string to float: 'matrix.mtx'". Can you please provide info/code on how to load in the matrix? I am working with a dataset where I do not have the h5 file.

raw_counts is a cells by genes count matrix

labels = clf.fit(raw_counts).predict()

adamgayoso commented 3 years ago

what is the format that you have? Scanpy has a number of ways to load data in the anndata format, which could then be used exactly as in the tutorial.

https://scanpy.readthedocs.io/en/stable/api.html#reading

adamgayoso commented 2 years ago

Closing this issue due to inactivity.