ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mdivband/AV_IRL #1

"/lyceum/tg4u22/project/rollouts_1_hf_{suffix}.pkl" in airl_…

I don't know the format of input data, so I can't encode my own data into correct format, can you give a demo data of demonstrations in airl_train_loop.py. Thanks.

Arya2003lm updated 6 months ago
1
ray-project/ray #31783

[RLlib] AlgorithmConfig() defaults not used by build_sac_mod…

### What happened + What you expected to happen - [x] I searched for related issues and did not find anything matching. The closest issue(s) are: https://github.com/ray-project/ray/issues/22747 and…

gresavage updated 1 year ago
2
RobertTLange/gymnax-blines #2

Add A2C implementation

Reminder todo after internship. Mostly for meta-bandit and gridworld tasks

RobertTLange updated 2 years ago
2
ir413/mvp #17

72956 segmentation fault

Hi! Thanks for your great sharing! I met the `72956 segmentation fault` when I tried to train the task with `Pixels` suffix like `FrankaPickPixels`. Besides, I have finished the training success…

zichunxx updated 2 months ago
1
ray-project/ray #39519

[RLlib] Make `_check_if_diag_gaussian` available in the util…

### Description Right now the function is in `ppo_catalog.py` but will be used by many `RLModule` subclasses. Make the function available in a more central place like `rllib.utils`. ### Use case ``…

simonsays1980 updated 1 year ago
1
Stable-Baselines-Team/stable-baselines3-contrib #224

Implementing "Sibling Rivalry" Method from "Keeping Your Dis…

### 🚀 Feature I propose the implementation of the "Sibling Rivalry" method, as outlined in the paper "Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards." Link to …

vladyskai updated 9 months ago
1
linyiLYi/snake-ai #2

可以test，无法训练，报错

(SnakeAI) E:\snake-ai-master\main>python train_cnn.py Using cuda device Wrapping the env in a VecTransposeImage. Process SpawnProcess-5: Traceback (most recent call last): File "C:\Users\KEN202…

aijunzhao updated 1 year ago
18
gwsystems/composite #184

Static check: RETYPE KERN or USER to UNTYPED wouldn't work

Based on the code review, RETYPE to COSFRAME wouldn't work because it calls `pgtbl_get_cosframe()` which is an API to get a COSFRAME and would return `-EPERM` if this is not UNTYPED/COSFRAME memory. M…

phanikishoreg updated 6 years ago
2
jr-robotics/robo-gym #72

How can i start distributed parallel enviroment in the proce…

Hi there, the readme says that distributed parallel sampling can be implemented. But it doesn't look like this feature is presented in examples, for example the td3_script.py. In issure #24 , you s…

Daviddeer2 updated 1 year ago
1
ikostrikov/pytorch-a2c-ppo-acktr-gail #284

why PPO needs to store action_log_probs instead of using sto…

Hi, I am looking at the PPO implementation, and I am curious about this part (actually many other implementations are using this workflow as well, so I am also curious to see if I miss anything) …

Emerald01 updated 3 years ago
1

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for ppo

1000+ results
for ppo