ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mwhittaker/deeprl_project #16

Add plotting for PPO run

Deliverable: plot.py similar to what we had in the assignments that takes a log directory generated by PPO and spits out a learning rate curve. This is going to involve transition from our in-class…

vlad17 updated 7 years ago
2
openai/random-network-distillation #26

Wrong PPO Model architecture.

According to the DQN nature paper and [PPO1 implementation](https://github.com/openai/baselines/blob/ea25b9e8b234e6ee1bca43083f8f3cf974143998/baselines/ppo1/cnn_policy.py#L30), [this line](https://git…

alirezakazemipour updated 1 year ago
2
AI4Finance-Foundation/ElegantRL #369

Expected all tensors to be on the same device, but found at …

--------------------------------------------------------------------------- RuntimeError Traceback (most recent call last) Cell In[25], line 3 1 from elegantrl.tr…

Cometzyc updated 3 weeks ago
1
ray-project/ray #44475

[RLlib] Attribute error when trying to compute action after …

### What happened + What you expected to happen After training Multi Agent PPO with new New API Stack under the guidance of [ how-to-use-the-new-api-stack](https://docs.ray.io/en/latest/rllib/rllib-n…

Dr-IceCream updated 1 month ago
6
roboterax/humanoid-gym #27

I can't evaluate the Trained PPO Policy 'v1'

Dear author： I run this command: python scripts/play.py --task=humanoid_ppo --run_name v1 But it doesn't seem to be running correctly. Can you tell me how to solve this problem?

caixukun1234 updated 3 months ago
1
Farama-Foundation/Minari #258

[Proposal] Add more datasets for discrete-action envs

### Proposal Currently, there are only 2 datasets for [discrete](https://gymnasium.farama.org/api/spaces/fundamental/#gymnasium.spaces.Discrete)-action envs: - [Fourrooms](https://minari.farama.…

carlosgmartin updated 1 day ago
3
openpsi-project/ReaLHF #80

Suggestion for Fine-Grained Batch Control e.g `per_device_tr…

Hello there, First, I'd like to express my appreciation for your excellent work on this project. While experimenting with PPO/RW using this repository, I consistently encounter Out of Memory (OOM) e…

dechunwang updated 3 weeks ago
1
tensorflow/agents #320

action constraints in PPO

Hi, what is the best way to implement action constraints in a PPOAgent? For a `QPolicy` i can use `observation_and_action_constraint_splitter`. Is there something equivalent for ppo policies?

niklasnolte updated 2 years ago
4
tensorflow/agents #673

[PPO] PPOAgent works incorrectly

I'm trying to implement a PPO agent to play with LunarLander-v2 with tf_agents library like it was in [this tutorial](https://pylessons.com/LunarLander-v2-PPO/) ([_github repo_](https://github.com/pyt…

MaxTitkov updated 3 years ago
1
MorvanZhou/Reinforcement-learning-with-tensorflow #187

ppo中出现NAN

你好，莫烦老师，我在运行simple_ppo算法中，，根据当前状态选择一个动作 a=self.sess.run(self.sample_op,{self.tfs:s})[0]，，选择出来的动作为nan，，我应该如何修改，才能在运行代码过程中不在出现nan值，

xxx-007 updated 8 months ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for ppo

1000+ results
for ppo