ddpg-pytorch Search Results

225 results
for ddpg-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

eleurent/rl-agents #52

rl-agents compatible with continuous action spaces

I am wondering is [cross entropy method](https://github.com/eleurent/rl-agents/tree/master/rl_agents/agents/cross_entropy_method) the only one that is compatible with continuous action spaces? I tr…

SHITIANYU-hue updated 4 years ago
3
pytorch/rl #1344

[Feature Request] Q Ensembles

## Motivation Twin Q/ensemble Q functions are used in many RL algorithms and mitigate Q overestimation. My understanding is that TorchRL only deals with ensembles in the loss functions. This is fine …

smorad updated 1 year ago
5
openai/baselines #938

DDPG implementation fails to learn well on at least five MuJ…

Dear @pzhokhov @matthiasplappert @christopherhesse et al., Thank you for providing an implementation of DDPG. However, I have been unable to get it to learn well on the standard MuJoCo environmen…

DanielTakeshi updated 2 years ago
21
openai/spinningup #33

The random seed doesn't work

Even if I set the same random seed, the result is different, and you can test it on ddpg. I think `tf.set_random_seed(seed)` doesn't work, but I don't know how to solve it.

xffxff updated 4 years ago
9
samkoesnadi/DDPG-tf2 #7

Sweep: refactor the codebase to be shorter and more concise…

Checklist - [X] Extract `src/model.py` ✓ https://github.com/samuelkoes/DDPG-tf2/commit/4a93601e75c0faf96857e438112421c46cba8521 - [X] Create `tests/test_model.py` ✓ https://github.com/samuelkoes/…

samkoesnadi updated 1 year ago
2
cselab/smarties #2

questions

hello，is there implement code with python for ’Remember and Forget for Experience Replay Supplementary Material‘, I had trouble with the gradient calculation.Is it right for me to compute the gradient…

codingliuyg updated 4 years ago
11
Armandpl/furuta #21

v2.0

This is a tracking issue for the second iteration of this project. **CAD/Mechanical Assembly:** - [x] #29 - [x] #22 - [x] #30 - [x] #31 - [ ] finish designing the part to mount the slip ring t…

Armandpl updated 10 months ago
2
assume-framework/assume #398

Early Stopping, Learning rate and noise decay

Implement the best practices from multi-agent Rl community and stablebaselines3 into our algorithm. Further analyse similarities between petting zoo multi-agent implementation to current RL implementa…

kim-mskw updated 2 months ago
3
AI4Finance-Foundation/FinRL #384

FinRL_Ensemble_StockTrading error for NIFTY_50

I tried to solve the error for the NaN value according to this [reference](https://github.com/AI4Finance-Foundation/FinRL/issues/353#issuecomment-975188649) but after the preprocessing is done correct…

Soumadip-Saha updated 1 year ago
4
pytorch/rl #90

[Feature Request] Please provide the genetic and low-level f…

hi, it's really great that facebookresearch is considering provide a library for reinforcement learning research. it would be very helpful if the library provide the low-level functionality rather …

walkacross updated 2 years ago
9

上一页 1...1 2 3 4 5 6 7...23 下一页

225 results for ddpg-pytorch

225 results
for ddpg-pytorch