-
微博内容精选
-
# 🐛 Describe the bug
Hello, I download pytorch2.0 and play with the toy example
```python
import torch
import torchvision.models as models
import faulthandler
faulthandler.enable()
model …
-
I am walking through the implementation and it seems that the `MultiSignal.step` enforces a peculiarity where if the actor resolution is < than the yellow time of signals, then all actors are unable t…
-
Hi, All. Nice work! I would like to ask if your sumo-rl library can implement multi-agent vehicle control? For example, I am trying to make a scene: 1km straight single-lane for car following . There …
-
Hi, @ScheiklP
Sorry disturbing you again. Some time ago the question of "successful_task" was resolved. I drew the following line diagram with wandb.
Set "number_of_envs" to 8 and the results were …
-
Hi all,
I have a couple of questions regarding the chosen hyperparameters (i.e., network architecture, PPO hyperparameters, etc)
How did you decide on these specific values? (did you run a hyperpa…
-
Thank you so much for your work, When I run `pythonrun_rl_training.py --exp-name rl_test1 --num-gpus 3 --dataset_train /workspace/datasets/generated_1385_training/1385_training/ --dataset_test / work…
-
My research interest is robustness of RL algorithms to environment parameters. I want to modify currents RL algorithms to make them achieve good performance when they are tested in environments with u…
-
Hi guys!
I'm using Vizdoom for my bachelor thesis experiments and trained the agents in DeadlyCorridor.
I implemented a probability calculator for choosing the action "Attack" which is ridiculousl…
-
Hi, thanks for the great work.
I also have a question about KL divergence loss.
In papers like [Learning to summarize from human feedback](https://arxiv.org/pdf/2009.01325.pdf), the KL item for huma…