actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DanielTakeshi/rl_algorithms #5

Asynchronous Advantage Actor-Critic

I need that algorithm implemented here!!!

DanielTakeshi updated 7 years ago
2
google/brax #247

Short-horizon actor-critic

It occurred to me that this recent paper is an interesting one to implement inside brax One of the cool things about brax is its differentiability, but as I understand it, attempt to leverage that …

EelcoHoogendoorn updated 2 years ago
1
Tsinghua-Space-Robot-Learning-Group/SpaceRobotEnv #10

The trained policy is not useful to guide spacerobot how to …

Hello, I use the PPO method of your program to train the spacerobot, but I meet a problem now. I use the file(PPO/Continious/PPO/main.py) to train spacerobot, and the xml file is spacerobotstate, but …

Knight-xiao updated 2 weeks ago
1
google-research/reincarnating_rl #3

Clarification on actor-critic training

Thanks for the paper, it is really cool and useful On page 22 of the paper, it says > For reincarnating D4PG using QDagger, we minimize a distillation loss between the D4PG’s actor policy and the …

pseudo-rnd-thoughts updated 9 months ago
1
facebookresearch/Pearl #73

Simplify actor-critic with shared layers

The current implementation of `ActorCriticBase` makes it a bit trick to have custom actor and critic networks that have shared layers. This is because the instantiation of the networks happen in the `…

panoskyriakis updated 7 months ago
2
Zeta36/Asynchronous-Methods-for-Deep-Reinforcement-Learning #1

Implement the actor-critic methods

Hello, In the [asynchronous dqn paper](http://arxiv.org/pdf/1602.01783v1.pdf), they also described an on policy method, the advantage actor-critic (A3C), which achieved better results than others, do …

originholic updated 8 years ago
1
keavil/AAAI18-code #15

Is Actor-critic used here?

I am confused by your code. In the paper, it is mentioned that a policy gradient method [1] is used. But more specifically, I think that is implemented by Actor-Critic. If I am wrong, plz tell m…

RizhaoCai updated 5 years ago
1
kgex/developer-roadmap #484

Add Actor-Critic Methods resource

DineshkumarS05 updated 1 year ago
3
Tom-Pecher/RL-CW #2

Research RL agorithms for problem

Go beyond what is taught in the unit and look at frontier research to solve the RL problem.

Tom-Pecher updated 1 week ago
1
tensorflow/agents #426

Add Adavanced actor critic agent

Several Deep RL agents that are missing such as A2C, A3C which can be added. Further work could also adding MARL agents such as MAA2C or MADDPG

veds12 updated 3 years ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic