actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

catalyst-team/catalyst-rl #2

Implementing RNNs into RL algorithms

Hey! First of all thank you for this library! I would like to take your actors and critics and implement RNN-enhanced TD3 algorithm as described here: https://arxiv.org/pdf/1710.06537.pdf. I …

dtransposed updated 4 years ago
4
dennybritz/reinforcement-learning #200

Vanilla REINFORCE implementation

Hello, Is there any benefit to having a vanilla REINFORCE algorithm for people trying to learn the concepts? REINFORCE with Baseline includes a value function approximator which has a lot of simila…

alek5k updated 1 year ago
2
pytorch/torchtune #1395

[RFC] RLHF follow-ups

There are several optimizations to our PPO recipe which could help push it closer to SOTA in terms of performance. There are also several pieces of documentation we could offer alongside this recipe t…

SalmanMohammadi updated 2 weeks ago
1
marlbenchmark/on-policy #109

Cannot reproduce MPE simple_speaker_listener

I was trying to reproduce your results, however whenever I try to run the script about `simple_speaker_listener`, it crashes for a shape mismatch (Currently trying your last commit, but also on old co…

AlbertoSinigaglia updated 1 month ago
2
pytorch/examples #151

A3C instead of actor-critic in reinforcement_learning/reinf…

There is the code of reinforce.py `for action, r in zip(self.saved_actions, rewards): action.reinforce(r)` And there is the code of actor-critic.py: ` for (action, value), r in zi…

susht3 updated 2 years ago
1
isl-org/DirectFuturePrediction #6

What is continuous_controls for?

When I read the paper, they say that it works at discrete action space. Is it also possible at continuous action space???

wonchul-kim updated 7 years ago
3
PaddlePaddle/PARL #887

请问可以提供paddle版本的MATD3算法吗

cwy-16 updated 2 years ago
15
danielwillemsen/MAMBPO #3

Question about MASAC

Hi Daniel! Thanks for this excellent repo! I enjoy reading this paper too! Here are a little question on the baseline MASAC in your paper. In the above equation, you do not provide detail …

pengzhenghao updated 3 years ago
1
pytorch/captum #1123

How to calculate integrated gradients of each states in DRL …

I suppose SAC algorithm has one actor network and two critic network, now I want to rank the DRL states importance by calculate integrated gradients of each states to sork the states. so I wound if t…

1900360 updated 1 year ago
1
fly2rain/UMich_RL4MoblileHealth #1

Code release date?

Dear Feiyun, I've been reading your paper, [Cohesion-based Online Actor-Critic Reinforcement Learning for mHealth Intervention](https://arxiv.org/pdf/1703.10039.pdf), with much interest. I wo…

AjayTalati updated 7 years ago
1

上一页 1...3 4 5 6 7 8 9...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm