pendulum-v0 Search Results

540 results
for pendulum-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

acados/acados #463

SIGSEGV on large problems using the Python interface

Hello! There is a limit to the size of the problem over which a crash occurs (segmentation fault) when its trying to solve. I created a (not so) minimal example, but it is not the only one that cau…

pariterre updated 4 years ago
1
hill-a/stable-baselines #858

[question]reward for Humanoid-v2 (using ddpg)does not increa…

hi My reward stays at 300 after 100w steps whereas yours increases almost linearly. My actor loss is between -4.5 and -0.5 while my critic loss is between 0 and 0.06, which is way much smaller …

qwang029-wqy updated 4 years ago
6
hill-a/stable-baselines #228

DDPG agent with normalized returns and normalized obs

**Describe the bug** Loading a DDPG agent when trained using normalized observation or normalized returns does not work. The trained agent does not have the correct critic and the correct policy. Thi…

tperol updated 4 years ago
1
araffin/rl-baselines-zoo #43

ModuleNotFoundError: No module named 'stable_baselines.ddpg.…

**Describe the bug** ModuleNotFoundError: No module named 'stable_baselines.ddpg.memory', when loading ddpg pendulum-v0 **Code example** ```python from utils import ALGOS folder = "trained_agen…

ghost updated 5 years ago
1
tensorflow/agents #259

Issue with test ddpg

When I try to run the example DDPG test and eval scripts, I get the following issue: module 'tensorflow' has no attribute 'enable_resource_variables' If I change this line to: tf.compat.v1.enable…

vedantammihir updated 4 years ago
4
keiohta/tf2rl #83

TD3 IndentationError/OperatorNotAllowedInGraphError

Hi, I've been working with SAC up to now and it works fine with my custom environment. However, I wanted to test some other algorithms but I run into the following error that I have not really any ide…

cambel updated 4 years ago
10
hill-a/stable-baselines #198

[feature request] Implement goal-parameterized algorithms (H…

I'd like to implement Hindsight Experience Replay (HER). This can be based on a whatever goal-parameterized RL off-policy algorithm. **Goal-parameterized architectures**: it requires a variable for…

ccolas updated 4 years ago
22
ray-project/ray #4916

RuntimeError: invalid argument 2: invalid multinomial distri…

I'm training a PGTrainer using Pytorch and tune.run with the following command: tune.run(pg.PGTrainer, local_dir=".", stop={"episode_reward_mean": 0.5}, resourc…

sheetalsh456 updated 5 years ago
1
google-deepmind/bsuite #9

Adding a pendulum environment/experiment

I wanted to propose adding a pendulum experiment to bsuite. I think it fits the targeted, simple, challenging, scalable, fast criteria outlined in the bsuite paper. Also, now that https://github.com…

alexminnaar updated 5 years ago
1
ray-project/ray #5974

SAC performance on mujoco

Hi, I find it difficult to figure out working parameters for SAC. Is there just some standard examples like e.g. in the original softlearning environment?

kapsl updated 4 years ago
11

上一页 1...39 40 41 42 43 44 45...54 下一页

540 results for pendulum-v0

540 results
for pendulum-v0