FloList / EMPL

The Earth Mover's Pinball Loss: Quantiles for Histogram-Valued Regression
6 stars 0 forks source link

The Earth Mover's Pinball Loss:
Quantiles for Histogram-Valued Regression

This is the Tensorflow implementation of the paper The Earth Mover's Pinball Loss: Quantiles for Histogram-Valued Regression (ICML 2021, [http://proceedings.mlr.press/v139/list21a/list21a.pdf]). The Earth Mover's Pinball Loss (EMPL) is a loss function for Deep Learning-based histogram regression, which incorporates cross-bin information and yields distributions over plausible histograms, expressed in terms of quantiles of the cumulative histogram in each bin. The EMPL compares two (normalised) histograms and as

where and are the cumulative histograms. Here, is the quantile level of interest. For the particular case of the median (), the EMPL reduces to the Earth Mover's Distance (or 1-Wasserstein distance) between two 1D histograms (e.g., Ramdas, Trillos & Cuturi 2017). Therefore, the EMPL is an asymmetric generalisation of the Earth Mover's Distance that enables the regression of arbitrary quantiles of the cumulative histogram in each bin (conditional on some input) by harnessing the idea of the pinball loss (e.g., Koenker & Bassett 1978).

Author: Florian List (Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, NSW 2006, Australia).

For any queries, please contact me at florian dot list at sydney dot edu dot au.

Overview

Toy example (histograms generated by drawing numbered balls from an urn)

Bimodal example (distribution of cumulative histograms in each bin is bimodal)

Bundesliga example (histograms of the league table position after every week)

NOTE: The file Bundesliga_Results.csv needs to be downloaded from Kaggle (contains Bundesliga results from 1993/94 to 2017/18).

Astrophysical example (estimating brightness histograms from γ-ray photon-count maps) The astrophysical example can be found in [this](https://github.com/FloList/GCE_NN) repository.

# Citation If you find this code or the paper useful, please consider citing > @inproceedings{List2021,
>                            author = {List, Florian},
>                            booktitle = {Proceedings of the 38th International Conference on Machine Learning},
>                            title = {{The Earth Mover’s Pinball Loss: Quantiles for Histogram-Valued Regression}},
>                            url = {https://arxiv.org/pdf/2106.02051.pdf},
>                            year = {2021},
>                            archiveprefix = {arXiv},
>                            arxivid = {2106.02051}
>                            }