actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-team/keras-io #194

Possible issue of gradients calculation in actor_critic_cart…

In this example https://github.com/keras-team/keras-io/blob/master/examples/rl/actor_critic_cartpole.py, the gradient for the actor is defined as the gradient of loss $L = \sum \ln\pi (reward-value)$.…

refraction-ray updated 3 months ago
5
Improbable-AI/pql #1

When do critic and actor updates take place?

Hi, I came across your paper and had some doubts. My goal is to use your results and analysis to train discrete SAC for parallel minigrid environments. In `train_pql.py`, you have variables like…

kbkartik updated 5 months ago
1
ConvLab/ConvLab #118

Question: Actor Critic with Experience Replay

Hello guys, I wonder if there is a way to train the Actor Critic algorithms in an off-policy manner, as in the paper [Sample Efficient Actor-Critic with Experience Replay](https://arxiv.org/abs/1611.0…

brunonishimoto updated 3 years ago
1
raharth/PyMatch #32

Implement an Actor-Critics Algorithm (A3C)

raharth updated 3 years ago
1
MotoShin/dqn-tutorial #23

soft actor-criticの実装

## 概要 soft actor critcを実装する

MotoShin updated 3 years ago
2
kgex/developer-roadmap #399

Add A3C (Asynchronous Advantage Actor-Critic) resource

DineshkumarS05 updated 1 year ago
1
emad-arezoomand/online-actor-critic-algorithm-to-solve-continues-time-infinite-horizon-optimal-control-problem #2

online-actor-critic-neural-net-optimal-controler

I have the same problem as in first floor

Yuqing0127 updated 1 year ago
4
kweonwooj/papers #60

An Actor-Critic Algorithm for Sequence Prediction

## Abstract - Present training NN to generate sequences using actor-critic method from RL - Introduce **critic** network that is trained to predict the value of an output token, given the policy of …

kweonwooj updated 4 years ago
1
rgilman33/simple-A2C-PPO #1

critic and actor have same hidden layers

Hello, Correct me if I'm wrong, I'm under the impression that the critic and the actor share the same hidden layers in the tutorial notebook, why that constraint? Thanks

pjcorp updated 4 years ago
1
ShengrenHou/Optimal-Energy-System-Scheduling-Combining-Mixed-Integer-Programming-and-Deep-Reinforcement-Learning #5

Some question about the MIP_DQN.py

Hello, I'd like to understand how to use the "Actor_MIP" class in the provided code. This part is mentioned as a highlight in your paper, but it seems that the class is not called or utilized in the c…

Mrdawnzero updated 3 days ago
10

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic