soft-actor-critic Search Results

384 results
for soft-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #3376

[rllib] Add Soft Actor-Critic implementation

Soft Actor-Critic (SAC) [1] is currently one of the most efficient model-free RL algorithms available. Its sample-complexity is close to the best model-based reinforcement learning methods while still…

hartikainen updated 4 years ago
3
jaberkow/WaveRL #7

Project Page Stable Baselines

Hello, Nice project =) Quick question: did you try other algorithms that are usually more suited for continuous actions? (like soft actor-critic (SAC), DDPG and TD3 (coming in the next release) …

araffin updated 4 years ago
2
rlworkgroup/garage #202

tf/Soft Actor-Critic

ryanjulian updated 4 years ago
2
ChrisFugl/Shape-Memory-Alloy-Positioning #1

Debug Soft Actor-Critic

The implementation is currently not correct. We need to figure out why and fix it. Run with config/debug.yaml to test it. This configuration uses a very simple environment in which the goal is to m…

ChrisFugl updated 4 years ago
8
pytorch/pytorch #33195

Tanh Distribution Transform

## 🚀 Feature Could you please add a Tanh transform to the torch.distributions.transforms module? ## Motivation The policy network used by the Soft Actor Critic algorithm passes its output thr…

varun-intel updated 4 years ago
1
rail-berkeley/rlkit #91

No value function in Soft Actor Critic?

Dear author, In your implementation of soft actor critic, there is no value function V(s)? In the original paper of SAC, the authors said such value function can stabilize training and is c…

KK666-AI updated 4 years ago
1
hill-a/stable-baselines #771

Question about SAC implementation

Respected sir, I want to know, qf1_pi and qf2_pi are used to find min_qf_pi in sac. How the parameters of qf1_pi and qf2_pi models of sac.py are updated? As I did not find any loss function for th…

surbhi1944 updated 4 years ago
7
kengz/SLM-Lab #436

Any demo for soft actor critic on discrete action space?

Cool Work. It seems that you have implemented sac to support discrete action space. I wonder whether this project contains tiny demo from running soft actor critic on scenario of discrete action s…

KK666-AI updated 4 years ago
1
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #34

Reproduce Discrete Soft Actor Critic in TF 2.0

Hi are there any pointers on how to reproduce the Discrete SAC code in tf2? Especially the `torch.gather()` which does not particularly behave the same way as `tf.gather` or `tf.gather_nd`. Any help w…

junhuang-ifast updated 5 years ago
1
privacytools/privacytools.io #779

❌ Software Removal | Signal

## Problem with Signal Signal has ***copious*** privacy issues making it unfit for privacytools.io endorsement. 1. Users are forced to supply a phone number to Signal (https://github.com/privacy…

ghost updated 3 years ago
124

上一页 1...31 32 33 34 35 36 37...39 下一页

384 results for soft-actor-critic

384 results
for soft-actor-critic