pendulum-v0 Search Results

540 results
for pendulum-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

hill-a/stable-baselines #889

[question] Use of UnVecWrapper

Hello, I have some trouble understanding UnVecWrapper. When training with SAC, using the scripts provided in [Stable Baselines Zoo](https://github.com/araffin/rl-baselines-zoo) ```bash python t…

PierreExeter updated 4 years ago
2
hill-a/stable-baselines #929

[Question] Suboptimal TRPO experts from generate_expert_traj…

Hello. My objective is to imitate an expert using GAIL. I train a gym env with an RL algo (say TRPO), generate expert data from the learned model (using [``generate_expert_traj``](https://stable-basel…

prabhasak updated 4 years ago
4
NixOS/nixpkgs #50415

Julia 1.0.1 PyPlot module compile fails: UndefVarError: v no…

## Issue description Presently, under Julia 1.0.1, PyPlot's PyCall module fails to compile with the error: ```julia julia> Pkg.build("PyCall") Building Conda ─→ `~/.julia/packages/Conda/uQitS/…

fusion809 updated 4 years ago
14
hill-a/stable-baselines #838

TD3 not deterministic

Hi all, Can't get the same results twice with TD3 but it works with PPO2. ```python import gym from stable_baselines import TD3 from stable_baselines.td3.policies import MlpPolicy env = gy…

tguegan updated 4 years ago
4
DLR-RM/stable-baselines3 #66

Episode mean reward is not properly logged on tensorboard wh…

I'm training an agent on a custom environment using SAC. The environment is wrapped in a `Monitor`, which is wrapped in a `DummyVecEnv`, which is wrapped in a `VecNormalize`, with `norm_reward = True`…

siferati updated 4 years ago
7
hill-a/stable-baselines #945

evn.render() in doc examples don't run unless n_envs=1 in ma…

This is possibly quite trivial, but the `evn.render()` lines of the code examples do not run for multiprocess environments unless unless `n_envs=1` in `make_vec_env()`. For example, the code exampl…

pstansell updated 4 years ago
9
tpbarron/pytorch-ppo #1

tried on Pendulum-v0

doesn't work...

freddycct updated 4 years ago
2
hill-a/stable-baselines #762

Using callback for GAIL [question]

I am trying to save the GAIL model during training. I'm using the example code from the [Callback](https://stable-baselines.readthedocs.io/en/master/guide/callbacks.html#checkpointcallback) documentat…

prabhasak updated 4 years ago
6
hill-a/stable-baselines #772

Could not render SAC pendulum example given in Documentation

I have tried Pendulum-v0 animation to run in different codes but the code of stable_baselines docs of SAC only shows me the first frame of pendulum render screen and does not animate throughout. He…

sprakashdash updated 4 years ago
5
openai/gym #1801

Why is reward function of pendulum-v0 neglecting sign of ang…

Hi! Currently the reward function for pendulum-v0 is: `-(theta^2 + 0.1*theta_dt^2 + 0.001*action^2)` The signs of theta, theta_dt and action are not considered. If the angle < 0 and torque > 0 …

tessavdheiden updated 4 years ago
1

上一页 1...36 37 38 39 40 41 42...54 下一页

540 results for pendulum-v0

540 results
for pendulum-v0