theislab / cellrank_notebooks

Tutorials and examples for CellRank.
https://cellrank.org
BSD 3-Clause "New" or "Revised" License
6 stars 5 forks source link

Update time series tutorial #44

Closed Marius1311 closed 1 year ago

Marius1311 commented 2 years ago

Currently, our tutorial subsamples the data to 25% of the cells to speed up computations - rather than doing that, we should use our adaptive thresholding scheme, which is currently not used. Would be good to see whether this changes results - it's okay if it does actually, subsampling to 1/4 of the data is quite a reduction in cell number.

WeilerP commented 2 years ago

The adaptive thresholding only helps when computing the Schur decomposition, macrostates, etc. I reran the tutorial on the entire dataset and this part is fast (<= 1 minute for each). The problem is computing wk.compute_transition_matrix which took over an hour to finish. The eigengap is now after the fourth eigenvalue (before it was after the first, so this is better)

image

but the same macrostates are inferred as before.

image

Marius1311 commented 2 years ago

Thanks a lot @WeilerP for running this, that's great! Good to see we get the same macrostates. You're right, the actual part that computes the transport maps does take a while, of course. Maybe we should pre-compute them and load them from file (I guess they can be cached in the kernel @michalk8 ? ) I think that would be the best solution actually, since we could showcase how to run CellRank on really large data.

michalk8 commented 1 year ago

closed via https://github.com/theislab/cellrank_notebooks/pull/64