-
Hi and thank you very much for your work,
I would like to use the MuJoCo implementation of Hopper, which has obs_dim=11 and action_dim=3. However, when making the environment with gym.make('HopperM…
-
Post your questions here about: [“Language Learning with Large Language Models”](https://docs.google.com/document/d/1vCRoU_g9yYwG31uZMdAVK8iNL5Jj8BB4iwcvarTq06E/edit?usp=sharing) and “Digital Doubles …
-
Pose a question about one of the following articles:
[“Generative agents: Interactive simulacra of human behavior.”](https://dl.acm.org/doi/abs/10.1145/3586183.3606763) Park, Joon Sung, Joseph O'B…
-
First of all, thanks for the amazing implementation, really helpful for understanding the DQN.
I'm curious about the results on the space invaders, it shows the avg 2772, while PER original paper sho…
-
-
### ❓ Question
It seems that this system does not support MARL ?
### Checklist
- [X] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo
…
-
I don't see any reference to mpiexec when searching in the repo. It it intended that we run with mpiexec to get a parallel version of DDPG?
eg I've tried this:
`mpiexec -n 4 python -m baselines.d…
-
there is interesting literature from psy/cognitive science how system 2 might work. it's not describing thorough cognitive architectures, but is relevant nonetheless. I'll grab some and drop them here…
-
### Summary
1. どんなもの?(Abstract,Conclusion)
エージェントがキーボードやマウス操作によってインターネット上のタスクを実行する強化学習環境, World of Bits (WoB)を開発した。WoBの主な課題は2つある。(i) ウェブベースのタスクを要約、整理すること、(ii) 報酬構造があり、ウェブの移り変わりにもかかわらず再現可能であること。HTTPト…
-
Post your questions here about: “[Training and Taming Deep Networks](https://docs.google.com/document/d/1gne-oWcJs1p5sEjUumapq6HKeaOet3EHxJ-Ij0LuTro/edit?usp=sharing)” & “[The Expanding Universe of De…