Just read your paper on Arxiv and saw your code implementation, impressed by your results. Great job! :)
Dropping by since we are actively developing RL4CO, a library for all things Reinforcement Learning for Combinatorial Optimization. The idea is to make everything as easy to use and modular as possible in the hope of helping practitioners and researchers, with the help of some recent software (such as TorchRL, TensorDict, PyTorch Lightning, FlashAttention, and Hydra - also saw that you are using it!).
We think your work could be a great addition to RL4CO and potentially gain visibility - if you are interested in contributing, feel free to leave PRs/issues or contact us anytime 😉
PS: your other work, JAMPR would also be great, we are looking to exploring time-windows as well soon~
Hi there 👋🏼
Just read your paper on Arxiv and saw your code implementation, impressed by your results. Great job! :)
Dropping by since we are actively developing RL4CO, a library for all things Reinforcement Learning for Combinatorial Optimization. The idea is to make everything as easy to use and modular as possible in the hope of helping practitioners and researchers, with the help of some recent software (such as TorchRL, TensorDict, PyTorch Lightning, FlashAttention, and Hydra - also saw that you are using it!).
We think your work could be a great addition to RL4CO and potentially gain visibility - if you are interested in contributing, feel free to leave PRs/issues or contact us anytime 😉
PS: your other work, JAMPR would also be great, we are looking to exploring time-windows as well soon~