soft-actor-critic Search Results

384 results
for soft-actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apourchot/CEM-RL #1

Have you tried CEM on soft actor critic alg? (SAC)

Hi, thanks for releasing your code ! Btw, have you by any chance tried implementing your CEM method on SAC algorithm? thanks !

tldoan updated 5 years ago
3
ray-project/ray #8394

[raysgd] How to set spot workers for an Azure cluster?

In Azure, we should be able to set a node as spot (preemptible) by setting [properties.priority = 'Spot'](https://docs.microsoft.com/en-us/rest/api/compute/virtualmachines/createorupdate#virtualmachin…

AndreCNF updated 4 years ago
11
araffin/rl-baselines-zoo #18

Google colab error for Soft actor critic

I am running the rl-baselines-zoo for humanoid bullet in google colab. At first I ran it with ppo2 and it gave a very good result with rewards going upto 1600. Now I am running the Softactor critic an…

testerpce updated 5 years ago
7
astooke/rlpyt #118

Positive Log Likelyhood from Guassian

I'm running a soft actor-critic algorithm, and my alpha value is going to infinity. I've traced the bug back to the fact that I am getting positive values of log_pi. This means there is probably a bug…

jordan-schneider updated 4 years ago
2
facebookresearch/ReAgent #84

More supported models?

Dear authors, Great work for the excellent. Below are the lists of supported models, which we think some other more methods are also crucial for some applications. Discrete-Action DQN Parametric…

JunchenJin updated 5 years ago
3
rail-berkeley/rlkit #48

Make sure you dont need a Mujoco license to use any of the a…

There are many algorithms that import Mujoco environments because they are not separated. In my case I dont care about Mujoco, in fact I had to get a trial license just to avoid having to remove code …

redknightlois updated 5 years ago
5
hill-a/stable-baselines #276

[Question] Algorithmic differences between stable-baselines …

When I am conducting experiments and find that some approach works well, I want to compare my results to an established baseline. OpenAI Baselines is one, but (for obvious reasons listed in the readme…

dniku updated 5 years ago
1
pranz24/pytorch-soft-actor-critic #4

reproducibility for HalfCheetah-v2

Hi, I ran your code by just setting timestep to 3 millions like in the official paper (the other parameters were let by default like in your code). I couldn't reproduce the 15,000 result of the pap…

tldoan updated 5 years ago
5
samialabed/rlcache #3

[Agent]TTL Estimation

Consider using Soft actor critic for TTL estimation.

samialabed updated 5 years ago
2
rlgraph/rlgraph #35

[Algorithm] Implement soft-actor-critic

Seems a good candidate for inclusion: https://arxiv.org/abs/1801.01290 Applications: https://arxiv.org/abs/1812.05905

michaelschaarschmidt updated 5 years ago
1

上一页 1...33 34 35 36 37 38 39...39 下一页

384 results for soft-actor-critic

384 results
for soft-actor-critic