-
I don't know the format of input data, so I can't encode my own data into correct format, can you give a demo data of demonstrations in airl_train_loop.py. Thanks.
-
### What happened + What you expected to happen
- [x] I searched for related issues and did not find anything matching. The closest issue(s) are: https://github.com/ray-project/ray/issues/22747 and…
-
Reminder todo after internship.
Mostly for meta-bandit and gridworld tasks
-
Hi! Thanks for your great sharing!
I met the `72956 segmentation fault` when I tried to train the task with `Pixels` suffix like `FrankaPickPixels`.
Besides, I have finished the training success…
-
### Description
Right now the function is in `ppo_catalog.py` but will be used by many `RLModule` subclasses. Make the function available in a more central place like `rllib.utils`.
### Use case
``…
-
### 🚀 Feature
I propose the implementation of the "Sibling Rivalry" method, as outlined in the paper "Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards." Link to …
-
(SnakeAI) E:\snake-ai-master\main>python train_cnn.py
Using cuda device
Wrapping the env in a VecTransposeImage.
Process SpawnProcess-5:
Traceback (most recent call last):
File "C:\Users\KEN202…
-
Based on the code review, RETYPE to COSFRAME wouldn't work because it calls `pgtbl_get_cosframe()` which is an API to get a COSFRAME and would return `-EPERM` if this is not UNTYPED/COSFRAME memory. M…
-
Hi there, the readme says that distributed parallel sampling can be implemented. But it doesn't look like this feature is presented in examples, for example the td3_script.py.
In issure #24 , you s…
-
Hi,
I am looking at the PPO implementation, and I am curious about this part (actually many other implementations are using this workflow as well, so I am also curious to see if I miss anything)
…