Closed siboehm closed 2 years ago
Currently running this
Trained 5 new GROVER models on LINCS, with large encoders (up to depth 7) and large drug embedders and dosers (depth ~4). Latent size was 64, I adjusted the adversarial parameters and only a single model was easy to disentangle (which is good).
There weren't any surprisingly good results, so I think we're in the right ballpark regarding sizes.
In this plot I selected the top 2 models that had a reasonable disentanglement loss (<0.1), and the lowest val score. Batch_id: 45 (large models), 39 (normal models).
@MxMstrmn Closing this, unless you have objections
The Vanilla Model with ~6M parameters is outperforming most of our other model, however these other models have maximum ~2M parameters.
It might be worth scheduling some runs with very large models, to see how they perform.