-
- Fed-batch 논문 simple version으로 수정
- Polymerization 파라미터체크 (논문과 do-mpc 버젼 파라미터 다름)
- DQN, QRDQN 코드 체크
-
你好,请问训练环境UAVenv在哪里能找到呀?
-
Hello
I am very grateful to you for sharing your models. They are very clear and detailed.
But I see that there are many files in folders. Could you show me how to run each scenario?
Thank you …
-
Hi,
I would like to ask whether there is a jax-based code.
And whether there are some recommendations about jax-based offline rl algorithms.
Thanks!
-
Hi, it seems the code only modifies one edge per graph. Is there a simpler way to change the code to multiple edges modification? I have not found a parameter in the code to control the edge budget. T…
-
Excuse me. I installed it in "Ubuntu 14.04.3 LTS". When I run "./run_cpu breakout", it shows the following error. Would you please help me on this? Thanks a lot in advance!
# ./run_cpu breakout
-fram…
-
Hi! I am really interested in this project as the VIPER algorithm is relevant for my own research (which is also within safe and explainable RL). Therefore I would like to know, if you are interested …
-
我下载了DQN的代码,发现运行报错,主要错误在两个地方上
1.choose_action(self, observation):
observation = observation[np.newaxis, :]出现错误为TypeError: tuple indices must be integers or slices, not tuple
2.在修改了第一部分的错误之后(通过课程讨论区一…
-
I'm trying to run the test job `./bin/dqn -save state/test -alsologtostderr --nogpu` on the latest version of HFO without a gpu. Even after 2000 iterations I'm not seeing any improvement in episode re…
-
Have you managed to create a pong project yet?