-
## Bug description
RewardNet `predict_processed` method only works using `state, action, next_state and done` attributes, despite trained using only `state, action`.
For example, the [BasicRewardN…
-
### What happened + What you expected to happen
When using a dict action space with different Spaces inside the ordering of the OBS in DictFlatteningPreprocessor is taken into account.
So at the m…
-
Hello author, thank you for your excellent work.
I have some questions about the inference stage. In some of my previous experiences, such as Deepmimic and AMP methods, a separate model is trained …
-
### Question
I tried hard to train an agent to solve any of the AntMaze environments. I tried the stable baselines 3 implementations of SAC (dense and sparse) and PPO, but could not solve even a sm…
meppe updated
1 month ago
-
### 🐛 Bug
I am implementing a simple custom environment for using PPO with MultiDiscrete observation space.
It works if I use MultiDiscrete([ 5, 2, 2 ]), but when it becomes a multidimensional array…
-
### What happened + What you expected to happen
While running the action masking example for the new API found [here](https://github.com/ray-project/ray/blob/master/rllib/examples/rl_modules/action…
-
There is an issue with the seeds after the Seeding update (#2422).
```
File "/home/joe/reinforce/rlenv/lib/python3.8/site-packages/gym/wrappers/time_limit.py", line 29, in reset
return self.e…
-
In [`619b761`](https://github.com/JonasSchaber/GymAll/commit/619b7612e27a69b9f2f95954450ce8acf0f3b292
), Homepage (gymnasium-allermoehe.hamburg.de) was **down**:
- HTTP code: 0
- Response time: 0 ms
-
In [`4460724`](https://github.com/JonasSchaber/uptime/commit/4460724b497e1294a9477cf61551fb4bd334649a
), Homepage (gymnasium-allermoehe.hamburg.de) was **down**:
- HTTP code: 0
- Response time: 0 ms
-
## install finrl library
!pip install git+https://github.com/AI4Finance-LLC/FinRL-Library.git
Collecting git+https://github.com/AI4Finance-LLC/FinRL-Library.git
Cloning https://github.com/AI4Fi…