I would like to implement the CrossQ algorithm as proposed in https://openreview.net/pdf?id=PczQtTsTIX.
I already did some testing and it looks like its not that much effort to implement it based on SAC.
I will implement it for myself anyway, so I might as well contribute to the repo.
Please let me know if this contribution is something you are interested in.
Problem Description
I would like to implement the CrossQ algorithm as proposed in https://openreview.net/pdf?id=PczQtTsTIX. I already did some testing and it looks like its not that much effort to implement it based on SAC. I will implement it for myself anyway, so I might as well contribute to the repo. Please let me know if this contribution is something you are interested in.
Checklist
poetry install
(see CleanRL's installation guideline.