-
I am wondering is [cross entropy method](https://github.com/eleurent/rl-agents/tree/master/rl_agents/agents/cross_entropy_method) the only one that is compatible with continuous action spaces?
I tr…
-
## Motivation
Twin Q/ensemble Q functions are used in many RL algorithms and mitigate Q overestimation. My understanding is that TorchRL only deals with ensembles in the loss functions. This is fine …
-
Dear @pzhokhov @matthiasplappert @christopherhesse et al.,
Thank you for providing an implementation of DDPG. However, I have been unable to get it to learn well on the standard MuJoCo environmen…
-
Even if I set the same random seed, the result is different, and you can test it on ddpg. I think `tf.set_random_seed(seed)` doesn't work, but I don't know how to solve it.
-
Checklist
- [X] Extract `src/model.py` ✓ https://github.com/samuelkoes/DDPG-tf2/commit/4a93601e75c0faf96857e438112421c46cba8521
- [X] Create `tests/test_model.py` ✓ https://github.com/samuelkoes/…
-
hello,is there implement code with python for ’Remember and Forget for Experience Replay Supplementary Material‘, I had trouble with the gradient calculation.Is it right for me to compute the gradient…
-
This is a tracking issue for the second iteration of this project.
**CAD/Mechanical Assembly:**
- [x] #29
- [x] #22
- [x] #30
- [x] #31
- [ ] finish designing the part to mount the slip ring t…
-
Implement the best practices from multi-agent Rl community and stablebaselines3 into our algorithm. Further analyse similarities between petting zoo multi-agent implementation to current RL implementa…
-
I tried to solve the error for the NaN value according to this [reference](https://github.com/AI4Finance-Foundation/FinRL/issues/353#issuecomment-975188649) but after the preprocessing is done correct…
-
hi, it's really great that facebookresearch is considering provide a library for reinforcement learning research.
it would be very helpful if the library provide the low-level functionality rather …