atarashansky / SAMap

SAMap: Mapping single-cell RNA sequencing datasets from evolutionarily distant organisms.
MIT License
64 stars 19 forks source link

input formats? #89

Closed gulfshores closed 1 year ago

gulfshores commented 2 years ago

Hi, I'm trying to compare a couple of species/datasets with SAMap. I've worked through a bunch of errors about the lack of parameters and formats in the input data. However, now I'm getting: ValueError: Input contains NaN, infinity or a value too large for dtype('float64') AttributeError: 'matrix' object has no attribute 'multiply' I've checked the ingoing matrices for these things and with different types on the matrices. What are they supposed to be? How did you pre process your datasets before samap (before what's in the notebook)?

Also I have figured out where you are supposed to get them from?! Mainly adata.var["weights"] and sam1.adata.uns["run_args"]. Where do these weight some from?

Hope you have the time to help! Max (Quake group)

atarashansky commented 2 years ago

Hi Max,

Could you post the code snippet you're trying to run along with the error stack trace?

atarashansky commented 2 years ago

Also, not sure if this is documented, but SAMap requires the AnnData.X matrix to be in scipy sparse format. It looks like you might have dense matrices. Let me see if I can soften this requirement to unblock you... I'll post back when I have updated the code.

Mainly adata.var["weights"] and sam1.adata.uns["run_args"]. Where do these weight some from?

These numbers come from running the SAM algorithm on the data.

EDIT:

image

I added a note to the above section in the tutorial notebook that you should have your expressions in sparse format.

atarashansky commented 2 years ago

Not sure about your error yet since I haven't seen the code or stacktrace, but try installing samap v1.0.7 (pip install samap==1.0.7) and see if that fixes your issue.