soft-actor-critic Search Results

384 results
for soft-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NodeBE4/weixin #3631

Forging a Partnership Between China &World in …

https://wechatscope.jmsc.hku.hk/api/html?fn=gh_14b07ab30393_2022-04-06_2247569371_xDoErwIui4.y.tar.gz

github-actions[bot] updated 2 years ago
10
NodeBE4/weixin #3960

Global Sci-tech Trends and the US-China Relationship

https://wechatscope.jmsc.hku.hk/api/html?fn=gh_14b07ab30393_2022-06-13_2247573711_IGbB26mhdX.y.tar.gz

github-actions[bot] updated 2 years ago
9
RealVNF/DeepCoord #2

How to implement the DeepCoord with SAC agent

Hi, thank you for your response to the last issue, it works. And I wonder if I could use other DRL algorithms (e.g., soft actor-critic, sac) to realize it. I saw the options"# Agent type: SAC o…

huanghub6224 updated 3 years ago
7
csingh27sewts/Masterarbeit #56

Try other implementations of Soft Actor Critic

csingh27 updated 3 years ago
1
DLR-RM/stable-baselines3 #1

Roadmap to Stable-Baselines3 V1.0

**This issue is meant to be updated as the list of changes is not exhaustive** Dear all, Stable-Baselines3 beta is now out :tada: ! This issue is meant to reference what is implemented and what …

araffin updated 3 years ago
46
DLR-RM/stable-baselines3 #108

Tensorboard log file not generated

**Describe the bug** In Stable Baseline, if I train `sac.SAC` with `tensorboard_log='./logs/'`, I get a Tensorboard log in `./logs/SAC_1/`. But, in Stable Baselines 3, with the same keyword argument,…

cisprague updated 3 years ago
16
mlpack/mlpack #2923

RL methods: sac.hpp is an implementation of TD3, not SAC.

Upon reading the [`sac_impl.hpp`](https://github.com/mlpack/mlpack/blob/master/src/mlpack/methods/reinforcement_learning/sac_impl.hpp), I realized that it's not an implementation of Soft Actor Critic …

gunnxx updated 3 years ago
5
luksfarris/deeprecsys #3

Add Actor-Critic implementation

Since I've ditched rl-agents in #1, might as well implement this one myself as well

luksfarris updated 3 years ago
1
luksfarris/deeprecsys #1

Add REINFORCE implementation

Time is really limited right now, so we figured we'd test the Dueling implementation against REINFORCE and a soft actor critic agent. This way we get a q-value based model, a policy gradient, and an a…

luksfarris updated 3 years ago
1
ac-93/soft-actor-critic #3

Result for training more than 100k

Hi. I was wondering that have you done training for more time steps than 100k or with other hyper-parameters? Unfortunately I don't have a GPU yet so I can not do experiments myself. The reports in…

merv22 updated 3 years ago
4

上一页 1...25 26 27 28 29 30 31...39 下一页

384 results for soft-actor-critic

384 results
for soft-actor-critic