KangaroosInAntarcitica / mixes

A mixture models package including GMM, Skew GMM, GMN and DGMM
Other
5 stars 0 forks source link
mixture-models python

Mixes - repository of mixture models

The repository is available as package in PyPI: pypi.org/project/mixes.

This repository was created as part of Research paper "Estimation of Gaussian Mixture Networks" required as part of Master of Mathematics degree in Computational Mathematics at the University of Waterloo authored by Andriy Dmytruk and supervised by Ryan Browne.

The repository includes implementation of the following mixture models:

Usage

You can install the package mixes from PyPI: pip install mixes and use it directly, e.g.: from mixes import Evaluator, GMM, GMN.

The implementation is present in the mixes/ folder. You can see an example of usage in the experiments/example.ipynb jupyter notebook.

All the experiments that were performed as part of the research paper can also be found inside the experiments/ folder.

Kindly provide a citation to the research paper if this work proves useful to you:

@thesis{GMN,
    title        = {Estimation of Gaussian Mixture Networks},
    author       = {Andriy Dmytruk},
    year         = 2022,
    type         = {mathesis},
    institution  = {University of Waterloo}
}

Models description

Skew Gaussian Mixture Model

Skew GMM was implemented based on paper "Maximum likelihood estimation for multivariate skew normal mixture models" by Tsung I. Lin (2006).

Gaussian Mixture Network

GMN was proposed in the author's research paper. The model creates a network of gaussian distributions where next layers in the model have conditional probability distribution based on the previous layer. Each layer is a mixture of components, therefore the whole model creates a network of gaussian nodes.

The most important parameters are:

The paper can be found on Research Gate or the University of Waterloo website.

Deep Gaussian Mixture Model

DGMM is based on papers "Deep Gaussian mixture models" by Cinzia Viroli, Geoffrey J. McLachlan (2019) and "Factoring Variations in Natural Images with Deep Gaussian Mixture Models" by Aaron van den Oord, Benjamin Schrauwen (2014).

The parameters are similar to GMN model, as is the implementation in this repository.

The difference between DGMM and GMN is that GMN gives probabilities to layer's components conditional on the previous layer, while DGMM has them independent.

Annealing

We implemented deterministic annealing for mixture models as described in the paper "On the Bumpy Road to the Dominant Mode" by Hua Zhou, Kenneth L. Lange (2010).

Since the log-likelihood functions is frequently non-concave, the EM algorithm can end up in suboptimal modes. The idea of annealing is to flatten the objective function and therefore increase the chances of terminating in a dominant mode.

The parameter use_annealing determines whether to use annealing, while the parameter annealing_start_v determines the intial value for annealing. The value must be between 0 and 1. Lower values correspond to a more flattened objective function, while 1 corresponds to no annealing. Starting for the annealing_start_v, the annealing value will be increased to 1 during model fitting if use_annealing is set to true.

Regularization

GMM, GMN and DGMM models have the variance regularization parameter var_regularization. Regularization makes the covariances larger on each step. This keeps the covariance matrix from becoming close to singular, which would greatly degrade optimization for it. The parameter can also be used for restricting the model to larger covariances and avoid overfitting.

Stopping criterion

Use the stopping_criterion parameter in models to specify a stopping criterion. Specified function must have the same signature as functions in the mixes/stopping_criterion.py file.