First of all, great job on your work, it has served as an inspiration to our research group!
We are actively developing RL4CO, a library for all things Reinforcement Learning for Combinatorial Optimization. We reimplemented POMO in the library and modularized it to be compatible with problems other than TSP and CVRP. We also used some recent software (such as TorchRL, TensorDict, PyTorch Lightning, and Hydra) as well as routines such as FlashAttention. We tried to make everything as easy to use as possible in the hope of helping practitioners and researchers.
Hi there 👋🏼
First of all, great job on your work, it has served as an inspiration to our research group! We are actively developing RL4CO, a library for all things Reinforcement Learning for Combinatorial Optimization. We reimplemented POMO in the library and modularized it to be compatible with problems other than TSP and CVRP. We also used some recent software (such as TorchRL, TensorDict, PyTorch Lightning, and Hydra) as well as routines such as FlashAttention. We tried to make everything as easy to use as possible in the hope of helping practitioners and researchers.
Feel free to check RL4CO out 🚀 ^ ^