pendulum-v0 Search Results

532 results
for pendulum-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #4658

gpu utilization rate 0 or near 0

### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**:Ubuntu 16.04.6 LTS (GNU/Linux 4.4.0-143-generic x86_64) - **Ray installed from (source or binary)**: pip inst…

yutao-li updated 5 years ago
8
ray-project/ray #4972

Issues for reproducing DDPG in 0.8.0.dev1

### System information - **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: macOS Mojave - **Ray installed from (source or binary)**: binary - **Ray version**: 0.8.0.dev1 - **Python ve…

wsjeon updated 5 years ago
1
hill-a/stable-baselines #289

Trying to run the GAIL Example Code

First of all thanks a lot for this awesome project. Stable-Baselines helps me a lot! I try to get a GAIL Agent going to get experience with Inverse Reinforcement Learning in combination of Gym and/…

JankyOo updated 5 years ago
3
hill-a/stable-baselines #225

ddpg learn eval/return and eval/Q should be mean?

In stable-baselines/ddpg/ddpg.py line 916 and 918 should the eval/return and eval/Q be np.mean to make scalar? `# Evaluation statistics.` `if self.eval_env is not None:` ` co…

keshaviyengar updated 5 years ago
5
ikostrikov/pytorch-a2c-ppo-acktr-gail #169

Why save the entire model, rather than its state_dict()?

Is there any reason why we are saving the entire model, rather than its state_dict()? Also, why do we create a CPU copy of the CUDA actor-critic network before saving it (line: 152, copy.deepcopy(acto…

Akella17 updated 5 years ago
1
rlworkgroup/garage #359

Remove custom environments which duplicate openai/gym

ryanjulian updated 5 years ago
4
openai/gym #1244

Runtime warning: invalide value encountered in remainder

i am testing my AC code using Pendulum-v0 environment and got this error: gym\envs\classic_control\pendulum.py:88: RuntimeWarning: invalid value encountered in remainder return (((x+np.pi) % (2…

mgkumar138 updated 5 years ago
2
rlworkgroup/garage #255

tf GaussianMLP* primitives seem to be unstable for large hid…

std_share_network=True, hidden_sizes=(200,200) observed behavior: std explodes

ryanjulian updated 5 years ago
20
ikostrikov/pytorch-a2c-ppo-acktr-gail #173

Extending support to continuous control environments

I want to know the significance of **squeeze** operation (line: 162) in ```a2c_ppo_acktr/envs.py```. The *squeeze* operation sends scalar values as **action_value** instead of singly-sized vectors for…

Akella17 updated 5 years ago
3
ikostrikov/pytorch-a2c-ppo-acktr-gail #167

Why is getattr(get_vec_normalize(envs), 'ob_rms', None) incl…

I would like to know why is getattr(get_vec_normalize(envs), 'ob_rms', None) saved along with actor-critc network? *line: 154-155, main.py*

Akella17 updated 5 years ago
3

上一页 1...43 44 45 46 47 48 49...54 下一页

532 results for pendulum-v0

532 results
for pendulum-v0