actor-critic-algorithm Search Results

752 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SciSharp/TensorFlow.NET #438

Integrate with ml-agents

If tf.net can be connected to this, it should be a lot easier. py often encounters some incompatibility problems, it is not easy to debug. Unity Machine Learning Agents Toolkit https://github.com/…

Deep-Blue-2013 updated 4 years ago
5
rlworkgroup/garage #1007

Question: Is Actor-Critic related algorithms being added any…

kishanpb updated 4 years ago
1
openai/gym #1758

Differences between Hopper-v1 and Hopper-v2

Hi, To my knowledge, I think hopper-v1 is deprecated and Hopper-v2 is the standard hopper as of today. Can someone validate if this is true ? In most of the RL papers, I see results where the au…

HareshKarnan updated 4 years ago
3
hill-a/stable-baselines #551

How exactly are the actor-critic networks created?

Deep Deterministic Policy Gradients ([DDPG][1]) and stable Baseline Code is presented [here][2]. The actor-critic networks are created as follows: normalized_obs = tf.clip_by_value(normali…

RyanRizzo96 updated 4 years ago
3
hill-a/stable-baselines #198

[feature request] Implement goal-parameterized algorithms (H…

I'd like to implement Hindsight Experience Replay (HER). This can be based on a whatever goal-parameterized RL off-policy algorithm. **Goal-parameterized architectures**: it requires a variable for…

ccolas updated 4 years ago
22
theogruner/rl_pro_telu #4

Non-ASCII character '\xce' in file

Hello, thanks for making this repo, I tried to connect my env and run it but I get the following error, **SyntaxError: Non-ASCII character '\xce' in file /home/at-lab/catkin_ws3/rl_pro_telu/mpo/mpo…

murtazabasu updated 4 years ago
3
oxwhirl/pymarl #18

Baselines used in COMA, such as Central-V, IAC-V.

I tried to implement baselines used in your paper, such as Central-V, IAC-V, under this project on 3M map, but I cannot reproduce the results reported in your paper. The following is the training cur…

yalidu updated 4 years ago
12
waylen94/Machine-Learning-Case-Study #1

20190916 Machine Learning study list

# Reinforcement Learning Study List -[] Brief of Reinforcement Learning -[] Methods -[] The reason to use -[] Preparation -[] Qlearning -[] Qlearning algorithm -[] Qlearning strategy -[…

waylen94 updated 5 years ago
2
hill-a/stable-baselines #329

[question] How to save a trained PPO2 agent to use in a Java…

I am using a PPO2 agent to train on a custom environment. I use the `save` function to store everything in a `.pkl` in the callback function, similar to the example from the Colab notebook. ```pyth…

josealeixopc updated 4 years ago
19
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #25

Additional `critic_target gather` in SAC_discrete.py

I found there's an additional gather operation in https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/master/agents/actor_critic_agents/SAC_Discrete.py#L74 It s…

YunqiuXu updated 5 years ago
1

上一页 1...62 63 64 65 66 67 68...76 下一页

752 results for actor-critic-algorithm

752 results
for actor-critic-algorithm