greenelab / tybalt

Training and evaluating a variational autoencoder for pan-cancer gene expression data
BSD 3-Clause "New" or "Revised" License
162 stars 62 forks source link

Saving the population of models for each ensemble across all z dimensions #128

Closed gwaybio closed 6 years ago

gwaybio commented 6 years ago

Previously (#117), through a sweep of different z dimensions, I saved a single compressed matrix for each z dimension even though I was training multiple. In #117 I also calculated stability measurements (correlation matrix determinants) - however, I was getting underflow as the dimensionality increased.

A solution is to save the population of models that are being trained. Saving the ensemble of models for each z will enable downstream stability estimates and also potentially improve biological discovery in Tybalt and ADAGE models. For convenience, I will call these models ezADAGE and ezTybalt :joy_cat:

I also update the parameter file (as a result of ADAGE tied weights in #123 ) (also related to #127 )