pinellolab / dictys

Context specific and dynamic gene regulatory network reconstruction and analysis
GNU Affero General Public License v3.0
101 stars 13 forks source link

Mathematical description of indirect effects #61

Open ekernf01 opened 1 month ago

ekernf01 commented 1 month ago

Dear Dr. Wang,

Dictys is a beautiful and thorough piece of work -- thank you for your continued efforts. I don't have a software issue to report, just a docs request. What's the mathematical formula for the total effects returned by dictys network indirect, and what scale are those effects on, or what experiment would be needed to observe the effect? I am not sure but I have some guesses.

I have not studied stochastic processes much so I cannot tell at all whether this is equivalent to enforcing $X_j=0$ while simulating the differential equation model out to infinity.

Best regards. Eric Kernfeld

lingfeiwang commented 1 month ago

Hi Eric,

Thank you for taking the time into our theoretical derivations.

In this paper we are hoping to capture some of the kinetics that cannot be captured with traditional causal inference models, where stochastic process came into play. In this setting, we do not consider hard perturbations that simply set gene expression to specific values, like the 'do' operator. Such hard perturbations may be possible in social sciences like how the federal reserve sets the interest rate, but not really how things work in the physical world.

We considered three types of soft perturbations which we didn't elaborate in the paper.

I don't have the results at hand, but as far as I remember they are all equivalent and give the same result we used in the paper. We didn't consider complete KOs because they are almost identical with case 3 above because both give $xj\beta{ji}=0$, except having RNA level=0 which can lead to slight normalization differences.

Also, all the raw network parameters such as $\alpha$ and $B$ are already inferred before computing the total effect network. We don't need to find their values now. We are only considering infinitesimal perturbations on $\alpha$ (equivalent with other two cases above), which gave the theoretical result that $B^{(\infty)}=B^{-1}$. (Because the system is linear the effects can scale to finite perturbations.)

From here, there were several extra steps to consider for practical purposes:

We only used this final output as total effect network, which indicates the relative logFC of target gene expression caused by 1 relative logFC of regulator gene expression. Relative logFC is defined as the mean log expression level change divided by the standard deviation of the stochastic noise level in log expression.

I hope that answers your question and don't hesitate to follow up.

Best, Lingfei

ekernf01 commented 1 month ago

Thank you very much. This is extremely helpful. I do not fully understand the details yet, but I think I can proceed knowing just the last part of your response: relative FC in target per relative FC in regulator.

ekernf01 commented 1 month ago

A couple of follow up questions now that I have read more:

Thanks again!

lingfeiwang commented 1 month ago

Sure!

github-actions[bot] commented 6 days ago

Stale issue message