Pengfei Li, Jianyi Yang and Shaolei Ren
Note
This is the official implementation of the ICML 2023 paper
git clone git@github.com:Ren-Research/LOMAR.git
cd LOMAR
Then please refer to the install guide for more details about installation
To apply our algorithm (LOMAR) in online bipartite matching, you need three main steps
A script example for each step can be found in our brief tutorial.
In our experiment, we set $u_0 = 10$ and $v_0 = 60$ to generate the training and testing datasets. The number of graph instances in the training and testing datasets are 20000 and 1000, respectively. For the sake of reproducibility and fair comparision, our settings follows the same setup of our baseline.
Table 1: Comparison under different $\rho$. In the top, LOMAR ($\rho = x$) means LOMAR is trained with the value of $\rho = x$. The average reward and competitive ratio are represented by AVG and CR, respectively — the higher, the better. The highest value in each testing setup is highlighted in bold. The AVG and CR for DRL are 12.909 and 0.544 respectively. The average reward for OPT is 13.209 . |
The histogram of the bi-competitive ratios are visualized below. When $\rho = 0$, the ratio of DRL-OS / DRL is always 1 unsurprisingly. With a large $\rho$ (e.g. 0.8) for testing, the reward ratios of DRL-OS/Greedy for most samples are around 1, but the flexibility of DRL-OS is limited and can less exploit the good average performance of DRL.
Figure 1: Histogram of bi-competitive reward ratios of DRL-OS against Greedy and DRL under different $\rho$. The DRL-OS has the same online switching algorithm as LOMAR, while the RL model is trained with $\rho=0$. |
@inproceedings{Li2023LOMAR,
title={Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees},
author={Li, Pengfei and Yang, Jianyi and Ren, Shaolei},
booktitle={International Conference on Machine Learning},
year={2023},
organization={PMLR}
}
Thanks for the code base from Mohammad Ali Alomrani, Reza Moravej, Elias B. Khalil. The public repository of their code is available at https://github.com/lyeskhalil/CORL