-
### 🐛 Bug
The documentation of DQN agent (https://stable-baselines3.readthedocs.io/en/master/modules/dqn.html) specifies that log_interval parameter is "The number of timesteps before logging". How…
-
Hi,
I am running your implementation of DQN.
Tried several runs, but it does not seem to converge with the given hyperparameters.
Tried higher amount of episodes in the magnitude of 1000, but the…
-
### What happened + What you expected to happen
Recent changes in the Replay Buffer APIs have made it so Apex DQN crashes while trying to add a sample to its replaybuffer.
The reproduction scrip…
-
凡事翻译过贡献者,都可以申请得到官方的纪念礼物:
如果有问题的人私聊: QQ 529815144
```
邮件部分内容:
As a start, let us know where we can ship you some PyTorch swag!
We have stickers, t-shirts, hoodies, and backpacks – let us know w…
-
I am trying to run the colab notebook attached to the reinforcement q learning example.
I keep getting following error:
pytorch.org -> Tutorials -> Reinforcement Learning -> Reinforcement Learning…
-
Hi Edouard,
Thank you for your amazing contribution at the first place.
I am currently studying DQN network(image-input with convolutional network) and want to implement it to highway-env.
I have…
-
## DI-engine
- 项目地址:https://github.com/opendilab/DI-engine
- 类别:Python、机器学习
- 项目标题:DI-engine 是一个基于 PyTorch 和 JAX 的通用决策智能引擎。
- 项目描述:
**DI-engine** 以 **python-first** 和 **asynchronous-nati…
-
The blog repo is now ca. 300MB heavy - should we maybe move the images etc... out into a dataset?
What do you think @osanseviero ?
```
25.0 MiB [##########] /27_summer_at_huggingface
24.9…
-
### 🐛 Bug
I'm trying to import [HParam as per documentation](https://stable-baselines3.readthedocs.io/en/master/guide/tensorboard.html#logging-hyperparameters) in order to create a callback class. Bu…
-
**Describe the bug**
An environment with a Dict Action Space, but a non-Dict non-Tuple Observation Space, will fail the `_check_nan(env)` [check](https://github.com/DLR-RM/stable-baselines3/blob/97b8…