Closed vals closed 6 years ago
Just to record some context here in writing, this occurred when trying to use AEH to find expression patterns along a 2D diffusion map of a ~10,000 scRNA-seq dataset.
Notebook In the notebook I grab the embedding from an AnnData object. Here, I've just included a csv file with the columns X_diffmap1 and X_diffmap2
Data for reproducing the error
Let me know if there are any issues accessing everything!
Wow it's pretty slow with that many cells it turns out!
I can reproduce the problem with a smaller subsample of the data though, so I'll get to looking into that.
I think I fixed the problem in a stable way. I'm just rerunning your clustering now to make sure the results didn't change too much.
Hey @dpcook I reran this over the day, now there's no division by 0 error, and the delta_ELBO is properly decreasing.
You would want to run this until delta_ELBO becomes close to 0 though, which will probably take quite some time with this dataset!
I think the patterns I get after 3 iterations are roughly similar to yours though, so I'll push this change to PyPI.
Thanks for the nice report and data needed to debug this!
/Valentine
Thanks for this @vals! Sorry for delayed reply--just got back from vacation after SCB18.
I'll start running it today and will let you know if there are any issues!
Glad to report that after runner for a couple days, it seems to have converged after 99 iterations.
Wow! Thanks for checking.
Reported by David Cook