-
Hi, first of all, great work. This is a very useful library for research on RL and NLP. It will be very helpful if it's possible to add off-policy RL methods like Q-learning, SAC, etc. along with benc…
Div99 updated
1 month ago
-
Hi @praveen-palanisamy
I have been working on macad-gym successfully over the past few months using PPO and many other algorithms. Now I am trying to use DDPG using RLlib which requires continuous…
-
I want to build an RL algo that will understand the concept of beating a benchmark (say S&P500), at a tic level. So if a tic is constantly beating the benchmark, the algo should prefer to pick that ti…
-
There are several state of the art algorithms that use search to improve the policy trained with RL(e.g. AlphaZero, Student Of Games). The current implementation of ML-Agents does not seem to support …
-
- Value based RL
- [ ] DQN
- [ ] Rainbow DQN
- [ ] [CQL](https://sites.google.com/view/cql-offline-rl)
- Value based + Policy based RL
- [x] DDPG
- [ ] [TD3](https://spinni…
-
- [X] I have marked all applicable categories:
+ [ ] exception-raising bug
+ [ ] RL algorithm bug
+ [ ] system worker bug
+ [ ] system utils bug
+ [X] code design/refactor
…
-
**Is your feature request related to a problem? Please describe.**
Develop an RL agent to exploit arbitrage opportunities in the foreign exchange market by trading currency pairs.
**Describe the s…
-
Hey!
First of all thank you for this library!
I would like to take your actors and critics and implement RNN-enhanced TD3 algorithm as described here: https://arxiv.org/pdf/1710.06537.pdf.
I …
-
I am conducting reinforcement learning for a robot using rsl_rl and isaac lab. While it works fine with simple settings, when I switch to more complex settings (such as Domain Randomization), the foll…
-
### What happened + What you expected to happen
### The bug
Default instantiation of `RLlib` algorithms causes deprecation warnings (even when the new API stack is selected). Furthermore, I cannot d…