theislab / multimil

Multimodal weakly supervised learning to identify disease-specific changes in single-cell atlases
https://multimil.rtfd.io/
BSD 3-Clause "New" or "Revised" License
19 stars 3 forks source link

Compare architectures #38

Closed alitinet closed 3 years ago

alitinet commented 3 years ago

Compare architectures on the following example: 2 datasets, one paired CITE-seq (pair1), one just RNA (pair2). RNA is mod1, protein is mod2. Output of the shared encoder for pair1 is z_1, output for pair2 is z_2. Modality vectors are denoted by v_1, v_2.

alitinet commented 3 years ago

Old model:

Notes:

alitinet commented 3 years ago

New model 1:

alitinet commented 3 years ago

New model 2:

Notes:

alitinet commented 3 years ago

Hey @M0hammadL, here's some results with the new models, I am not exactly sure why in case of the first (old model) and the third model (new model 2) modality vectors converge to v2 = 1/2 v1, but I think it's because of the way MMD is calculated in the void space.

In new model 1, so when the assumption is that z2 incorporates both modalities even though it's coming from just rna, I don't like that z2 is fed directly into mod_dec1. But if we feed e.g. z2 - v2 into mod_dec1, then the situation becomes weird when we have 3 modalities, e.g. CITE and ATAC. Then we'd have to assume that data coming from CITE-seq has actually all 3 modalities, and then subtract e.g. v_atac from z1, which I think doesn't make much sense.

alitinet commented 3 years ago

Decided to go for a PoE architecture with vector arithmetic (#39).