-
I have a custom environment with a [MultiDiscrete](https://github.com/openai/gym/blob/master/gym/spaces/multi_discrete.py) action space. The MultiDiscrete action space allows controlling an agent with…
-
### Search before asking
- [ ] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues.
### Ray Component
RLlib
### Issue Severity
High: It blocks me to co…
-
### What happened + What you expected to happen
I am trying to run A3C algorithm. But I got this error.
2024-07-09 09:46:19.039023: W tensorflow/stream_executor/platform/default/dso_loader.cc:55] …
-
Thanks for sharing such a cool thing!
But I have an another issue in master branch:
```
AttributeError: 'FastAtariEnv' object has no attribute '_buffer'
2018-05-21 21:24:06 MainThread: Popul…
-
### Question
Hi, I am working on simple_adversary_v3 along with agileRL to train the agents. I am using parallel_env and want to monitor the agents positions while training. I found that step functio…
-
**Is your feature request related to a problem? Please describe.**
Hi there, Burr is a popular repository to build agents. But most copilots aren't able to help with the latest syntax. So we created …
-
I am trying to run a QRNN Network with Neural LinUCB Agent.
My parameters are as follows:
```
AGENT_ALPHA = 1
GAMMA = 1
EPS_GREEDY = 0
LAYERS = (512,256,128)
LR = 0.005
NUM_TRAINING_STEPS = 10…
-
## Describe the bug
Despite applying the appropriate guards (`mp.set_start_method('spawn')`, `if __name__ == "__main__"`), using `MultiSyncDataCollector`
with the `cuda` device causes program to f…
-
### Search before asking
- [x] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues.
### Ray Component
Ray Core, Ray Tune
### What happened + What you ex…
-
As the definition goes, CTDE means, "The learning algorithm has access to all local action-observation histories and global state s, but each agent’s learnt policy can condition only on its own action…