actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #1

Add A2C algorithm

Add the A2C algorithm which is the synchronous version of the algorithm described in this paper https://arxiv.org/pdf/1602.01783.pdf and described here: https://medium.com/emergent-future/simple-rei…

p-christ updated 5 years ago
2
kkspeed/chess #4

Implement Actor Critic Agent

Implement and explore the effectiveness of actor critic agent.

kkspeed updated 6 years ago
1
hill-a/stable-baselines #1166

Deep Q-value network evaluation in SAC algorithm

I am implementing Soft-Actor Critic (SAC) agent and need to evaluate q-value network inside my custom environment (for the implementation of a special algorithm, called Wolpertinger's algorithm, to ha…

moizuet updated 2 years ago
2
pranz24/pytorch-soft-actor-critic #16

Unable to reproduce results on Humanoid-v2 in new SAC

I am unable to obtain the result as reported in the paper ‘Soft Actor-Critic Algorithms and Applications ’ on the openai environment Humanoid-v2. The result is 6000 while the original paper is 8000, …

zwfightzw updated 2 years ago
6
marlbenchmark/on-policy #98

Error when run ./train_mpe_spread.sh

When I tried to run ./train_mpe_spread.sh, I met the following issue: ``` obs_space: [Box(18,), Box(18,), Box(18,)] share_obs_space: [Box(54,), Box(54,), Box(54,)] act_space: [Discrete(5), Disc…

ChuangZhang1999 updated 1 month ago
2
pybrain/pybrain #22

Missing RL algorithm types

Nice to have would be a continuous value-based RL (best would probably be Neuro-fitted Q Iteration) as well as a life-long policy-gradient algorithm (e.g. Natural Actor Critic). Maybe some dynamic pro…

schaul updated 14 years ago
1
thu-ml/tianshou #795

RNN support for TD3 and SAC

Is RNN support available for TD3 and SAC algorithms? On the website of Tianshou there is a table that says that RNNs are not supported for both TD3 and SAC, however, there are functions RecurrentCriti…

qtomcatq updated 11 months ago
3
cosmicBboy/ml-research #23

[metalearn] support continuous policy action space

Support continuous action space for selecting real hyperparameters within the bounds specified by algorithm space config: - https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mounta…

cosmicBboy updated 4 years ago
1
Zhehui-Huang/quad-swarm-rl #49

sim2real c code problem

I let [sim2real.py](https://github.com/Zhehui-Huang/quad-swarm-rl/blob/master/swarm_rl/sim2real/sim2real.py) create the c code for network evaluation, however I am a bit confused about the calculation…

sAz-G updated 4 months ago
2
Coac/CommNet-BiCnet #2

variable sharability among critic and actor

Thanks for reply, I have been busy at another project last few days, recently I get spare time. I have noticed that at comm_net, the variables of communication part(maybe along with encoder part) a…

PeiYingjun updated 6 years ago
2

上一页 1...1 2 3 4 5 6 7...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm