Closed MikeDMorgan closed 5 years ago
As an update - this sparsity issue seems to be driven by include a filter gene list - using all genes circumvents the problem. However, I would expect that this makes the reduced dimensional representation much noisier.
NB: Selecting a subset of 100 random genes also works - I assume this is due to the selection of HVGs that are lowly expressed, which exacerbates the sparsity problem. I'll consider this issue resolved for now.
You can also try changing the solver using the command line arguments:
--solver fixed_iters
Thanks Joshua - I have tried this, but there is still a division by zero
, which suggested a sparsity issue; I wasn't confident that even running with very many iterations would get close to convergence.
Hi Mike,
I think this might be due to a small value for the entropy parameter epsilon. Can you try with a bigger value for epsilon?
Best, Geoff
On Wed, Nov 13, 2019 at 5:57 AM MikeDMorgan notifications@github.com wrote:
Thanks Joshua - I have tried this, but there is still a division by zero, which suggested a sparsity issue; I wasn't confident that even running with very many iterations would get close to convergence.
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/broadinstitute/wot/issues/73?email_source=notifications&email_token=ACJCQYU7OM5UUQBBDCNBR6DQTQBUBA5CNFSM4JMYJQYKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOED6HCRA#issuecomment-553414980, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACJCQYSOKSASLPBWWRWGXY3QTQBUBANCNFSM4JMYJQYA .
Hi Geoff, I tried a range of epsilon values from the default to ~1 with no appreciable effect other than it took several more iterations for the cost matrix to fill with NAs
. I suspect a very large epsilon would be required, on the order >> 10 to completely fix this, but then I'm not sure what the consequences would be for interpreting the transport map and downstream analyses. If you have any advice in that respect it would be greatly appreciated.
What values are you using for lambda 1 and 2?
As epsilon goes to infinity, you would get the independent coupling. This would be uninformative — any cell could go to any cell.
Just for the purpose of debugging, it might be nice to see if the code runs for epsilon 1,2,3,...10.
How many cells do you have in both time points?
On Fri, Nov 15, 2019 at 12:26 AM MikeDMorgan notifications@github.com wrote:
Hi Geoff, I tried a range of epsilon values from the default to ~1 with no appreciable effect other than it took several more iterations for the cost matrix to fill with NAs. I suspect a very large epsilon would be required, on the order >> 10 to completely fix this, but then I'm not sure what the consequences would be for interpreting the transport map and downstream analyses. If you have any advice in that respect it would be greatly appreciated.
— You are receiving this because you commented.
Reply to this email directly, view it on GitHub https://github.com/broadinstitute/wot/issues/73?email_source=notifications&email_token=ACJCQYQO7HWJJWBFK3LDL5TQTZMMJA5CNFSM4JMYJQYKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEEEV6OQ#issuecomment-554262330, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACJCQYVU2PKGSGDKEMEZ56TQTZMMJANCNFSM4JMYJQYA .
Hi, I'm attempting to debug an error in the optimal transport construction step, run on the command line.
The traceback is:
This error arises when computing the transport map between multiple different pairs of time points, so it is not specific to these two.
As far as I can tell the issue arises during the update of
a
andb
variables, because the matrixK
contains many small values very close to 0, which subsequently becomeInf
upon log transformation.Any thoughts on a possible solution?