stable-baselines Search Results

1000+ results
for stable-baselines

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/rllib #1

Bug in torch/tf DiagGaussian action distribution causing ran…

Dear, the bug is hard to reproduce because it is caused by numerical issues that happen only when the underlying neural network learns parameters that are too big for a DiagGaussian action distribu…

Francesco-Sovrano updated 3 years ago
1
rlworkgroup/garage #1862

Add option to Evaluator for recording videos

This is definitely a nice to have -- It'd be nice if as a part of snapshots, we could have the option to record the policy being rolled out deterministically for a few rollouts, so that we can watch i…

avnishn updated 3 years ago
1
boettiger-lab/gym_epidemic #1

Hyperparameter Tuning

Out of curiosity today, I looked into hyper-parameter tuning. For this the Optuna package seems like the way to go. In my first pass, I altered some of the code that stable-baselines used for tuning i…

mlap updated 4 years ago
1
rail-berkeley/softlearning #76

SAC Hyperparameters MountainCarContinuous-v0 - Env with dece…

Hello, I've tried in vain to find suitable hyperparameters for SAC in order to solve MountainCarContinuous-v0. Even with hyperparameter tuning (see "add-trpo" branch of [rl baselines zoo](https:…

araffin updated 4 years ago
10
toshikwa/slac.pytorch #15

Where did the

I am curious in your utils.py where the calculate_gaussian_log_prob(log_std, noise) function came from? It doesn't look like the stable baselines or pytorch Log PDF of the normal distribution. So what…

ChadMcintire updated 1 year ago
5
bstee615/rarl #2

Main agent

Implement the main agent with Trust Region Policy Optimization (TRPO, see [Link](https://arxiv.org/abs/1502.05477)) - [x] Set up InvertedPendulum environment in OpenAI Gym - [x] Set up neural net an…

bstee615 updated 3 years ago
1
lasseufpa/ml4comm #1

Only tensorflow v. 1 supported?

In a first test I created a new venv, then pip install -r requirements.txt, then python ./train_dqn_agent.py. I get: ModuleNotFoundError: No module named 'tensorflow.contrib' Does this mean only…

nbecker updated 2 years ago
3
dataiku/dss-plugin-reinforcement-learning #4

Missing Python packages

Hi, installed the plugin incl. code environment successfully. However, if I run it I get the following error: [14:39:43] [INFO] [dku.utils] - *************** Recipe code failed ************** [14…

tibfab updated 3 years ago
2
KNSI-Golem/terminal_serious #7

Zsetupowanie rlliba lub innej biblioteki z gotowymi algorytm…

Należy wziąć rlliba/stable baselines/inna biblioteka z algorytmami i wrzucić do repo kod, który uruchami jakieś PPO albo A2C na środowisku CartPole z gyma. Kolejny krok to sprawdzenie jak w tej biblio…

Vonski updated 3 years ago
1
hill-a/stable-baselines #201

Trying to understand hardware limitations for parallelizing …

**Describe the question** As far as I understand, when using a GPU, `SubprocVecEnv` runs multiple workers each running their own environment on a GPU and then updates the model when it has gathered a…

SerialIterator updated 5 years ago
3

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for stable-baselines

1000+ results
for stable-baselines