discrete-sac Search Results

Cattharine/product_owner_rl #63

Add new RL algorithm to compare with baseline (SAC Discrete)

Our current baseline RL algorithm is DQN (more accurately it is DDQN). Named algorithm uses epsilon-greedy policies to at least have a chance of fully investigating environment in question. Using epsi…

Cattharine updated 1 month ago

pfnet/pfrl #139

SAC-Discrete Implementation

Just wondering if there will be an upcoming SAC-Discrete implementation? Thanks, Christian

fcr3 updated 3 years ago

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #52

SpaceInvaders(SAC_Discrete) : Error

Dear Petros, Thank you very much for the implementations and it is very useful. I was able to successfully execute the code in the file Cart_Pole.py. I am now am trying to run the Space_Invaders.p…

FarhaParveen919 updated 1 year ago

keiohta/tf2rl #50

Implement discrete version of SAC

[Soft Actor-Critic for Discrete Action Settings](https://arxiv.org/abs/1910.07207v1)

keiohta updated 4 years ago

AliiRezaei/turtlebot3_rl #1

how are you setting a goal point?

Helly @AliiRezaei , Nice work. Thank you so much for sharing it. I am really interested particularly due to your work in C++. I am just wondering if we change the algorithm from discrete actions to…

abdul-mannan-khan updated 2 months ago

tensorflow/agents #507

SAC for discrete action (GumbelSoftmax reparameterization tr…

Hello, I need to make SacAgent work with discrete action, so try to implement GumbelSoftmax parameterization trick by re-defining the relevant classes. However, the calculation of `agent.train(experie…

Kang-SungKu updated 1 year ago

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #85

A question about critic-loss in discrete sac？

I applied the code of discrete sac to a custom discrete action environment. During the training process, I found that the loss of critic did not decrease but increased, and the critic-loss value after…

outshine-J updated 2 years ago

openai/spinningup #22

Can you elaborate on running SAC on discrete action space

In the docs, it is mentioned about an alternate version of SAC with slight change can be used for discrete action space. Please elaborate with some more details.

sandipan1 updated 4 years ago

Jeret77/Evil_Twin #6

Discretion 0

Je trouve que la carte réseau indiquer n'est pas discrète et prend trop de place dans mon sac lors de mes repérages. Je cherche une solution viable que me conseille tu?

Jeret77 updated 1 year ago

hill-a/stable-baselines #1166

Deep Q-value network evaluation in SAC algorithm

I am implementing Soft-Actor Critic (SAC) agent and need to evaluate q-value network inside my custom environment (for the implementation of a special algorithm, called Wolpertinger's algorithm, to ha…

moizuet updated 2 years ago

268 results for discrete-sac

268 results
for discrete-sac