-
https://arxiv.org/abs/2203.14708
-
It's been cited by many users as the reason for switching to Pytorch, but I've yet to find a justification / explanation for sacrificing the most important practical quality, speed, for eager executio…
-
Dear all, I am new to reinforcement learning, but I am fascinated with the Warp Drive. I was wondering if you could help me to build up my custom env for my little study project. The story of my env i…
-
Anki 2.1.54 (b6a7760c) Python 3.9.7 Qt 5.15.2 PyQt 5.15.5
Platform: Windows 10
Flags: frz=True ao=True sv=2
Add-ons, last update check: 2022-09-27 20:58:33
…
-
The first approach
`score = max( self._eval_num_done - self._eval_num_limit - self._eval_num_broken, 0) * mean(self._eval_max_reward) * self._eval_factor / self._eval_num_cycles
`
is a little …
-
Hello! Your work on MTMFQ has solved the heterogeneous agents problem, and I have learned a lot from it. However, I do not quite understand the implementation of Boltzmann exploration in the code.
ht…
-
Hello, I am studying your open source code, but when I run according to the operation instructions of the reinforcement learning "training agent" part in the readme file, I have the following problem.…
ghost updated
2 years ago
-
I have enough resources but still report a warning:
```
The actor or task with ID 124a2b0fc855a8f8ffffffff01000000 cannot be scheduled right now. It requires {CPU: 1.000000} for placement, but thi…
-
**Describe the bug**
I released version 0.1.3 of https://pypi.org/project/entity-gym-rs/ yesterday.
However, https://pypi.org/pypi/entity-gym-rs/json shows 0.1.1 as the latest version:
…
-
### Bug, feature, reasonable design choice - underline your opinion
The RL algorithms implemented in `stable-baselines3` do not support unbounded action spaces, i.e.
```
gym.spaces.Box(shape=s…