soft-actor-critic Search Results

pytorch/xla #8180

Model support for `soft_actor_critic` with Torch_XLA2

## Fix the model test for `soft_actor_critic.py` 1. setup env according to [Run a model under torch_xla2](https://github.com/pytorch/xla/blob/master/experimental/torch_xla2/docs/support_a_new_model…

ManfeiBai updated 3 weeks ago

Cattharine/product_owner_rl #63

Add new RL algorithm to compare with baseline (SAC Discrete)

Our current baseline RL algorithm is DQN (more accurately it is DDQN). Named algorithm uses epsilon-greedy policies to at least have a chance of fully investigating environment in question. Using epsi…

Cattharine updated 4 weeks ago

MotoShin/dqn-tutorial #23

soft actor-criticの実装

## 概要 soft actor critcを実装する

MotoShin updated 3 years ago

pytorch/pytorch #74134

No module named 'pygame': soft_actor_critic

alanwaketan updated 2 years ago

HiddenBeginner/hiddenbeginner.github.io #31

[강화학습] Soft Actor-Critic 논문 리뷰 - 재야의 숨은 초보

# [강화학습] Soft Actor-Critic 논문 리뷰 - 재야의 숨은 초보 [강화학습] Soft Actor-Critic 논문 리뷰 [https://hiddenbeginner.github.io/rl/2022/11/06/sac.html](https://hiddenbeginner.github.io/rl/2022/11/06/sac.html)

utterances-bot updated 7 months ago

tensorforce/tensorforce #676

[not issue] How to implement Soft Actor-Critic?

Hi @AlexKuhnle, sorry for bothering you; I would like to implement the SAC algorithm, and I'm wondering if you have some suggestions for that. In particular, I have some doubts about the following:…

Luca96 updated 3 years ago

number9473/nn-algorithm #247

Actor-Critic Algorithms

# Actor-Critic Algorithms # - Author: Vijay R. Konda, John N. Tsitsiklis - Origin: https://papers.nips.cc/paper/1786-actor-critic-algorithms.pdf - Related: - PyTorch4 tutorial of: actor critic…

joyhuang9473 updated 6 years ago

Kismuz/btgym #124

Overestimated Value Function in Actor Critic Framework

@Kismuz, I believe I have encountered a framework (A3C) limitation. While training a few of my recent models I noticed a strange behavior. For the first part of training everything seems to work fi…

JaCoderX updated 4 years ago

AI4Finance-Foundation/ElegantRL #346

SAC alpha update problem

In `obj_alpha = (self.alpha_log * (self.target_entropy - log_prob).detach()).mean()` when alpha_log=0, alpha will be 1forever. the correct way is `obj_alpha = (self.alpha * (self.target_entropy - log…

Shapeno updated 9 months ago

MishaLaskin/curl #20

Optimising encoder twice during CURL?

Thanks for sharing your code, it's great to be able to go through the implementation. Maybe I'm misunderstanding this, but it seem that if you intend `self.cpc_optimizer` to only optimise W, then …

wassname updated 8 months ago

384 results for soft-actor-critic

384 results
for soft-actor-critic