-
If tf.net can be connected to this, it should be a lot easier. py often encounters some incompatibility problems, it is not easy to debug.
Unity Machine Learning Agents Toolkit
https://github.com/…
-
-
Hi,
To my knowledge, I think hopper-v1 is deprecated and Hopper-v2 is the standard hopper as of today. Can someone validate if this is true ?
In most of the RL papers, I see results where the au…
-
Deep Deterministic Policy Gradients ([DDPG][1]) and stable Baseline Code is presented [here][2].
The actor-critic networks are created as follows:
normalized_obs = tf.clip_by_value(normali…
-
I'd like to implement Hindsight Experience Replay (HER). This can be based on a whatever goal-parameterized RL off-policy algorithm.
**Goal-parameterized architectures**: it requires a variable for…
-
Hello, thanks for making this repo, I tried to connect my env and run it but I get the following error,
**SyntaxError: Non-ASCII character '\xce' in file /home/at-lab/catkin_ws3/rl_pro_telu/mpo/mpo…
-
I tried to implement baselines used in your paper, such as Central-V, IAC-V, under this project on 3M map, but I cannot reproduce the results reported in your paper. The following is the training cur…
-
# Reinforcement Learning
Study List
-[] Brief of Reinforcement Learning
-[] Methods
-[] The reason to use
-[] Preparation
-[] Qlearning
-[] Qlearning algorithm
-[] Qlearning strategy
-[…
-
I am using a PPO2 agent to train on a custom environment. I use the `save` function to store everything in a `.pkl` in the callback function, similar to the example from the Colab notebook.
```pyth…
-
I found there's an additional gather operation in
https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/master/agents/actor_critic_agents/SAC_Discrete.py#L74
It s…