-
In Pytorch >= 1.4, grad_norm is a torch tensor (changed in https://github.com/pytorch/pytorch/pull/32020) and not a float, so the logger throws an exception here (`values` is now a list of pytorch te…
-
# Deep Q-Network (DQN) on LunarLander-v2 | Chan`s Jupyter
In this post, We will take a hands-on-lab of Simple Deep Q-Network (DQN) on openAI LunarLander-v2 environment. This is the coding exercise fr…
-
Traceback (most recent call last):
File "main.py", line 120, in
main()
File "main.py", line 117, in main
atari_learn(env, task.env_id, num_timesteps=task.max_timesteps, double_dqn=dou…
-
RuntimeError: Expected object of backend CPU but got backend CUDA for argument #4 'mat1'
File "/home/matt/Dropbox/Receiver/Antenna/antenna_sims/pytorch/DRL/Agents/Base_Agent.py", line 128, in run…
-
could you share the dependency for this repo?
-
Hi,
while working on a PyTorch DQN agent for BSuite experiments, I noticed quite bad results on the mnist and mountain car experiments. I see that a similar question was addressed [here](https://gi…
-
### Proposal
To encourage the use of Gymnasium and build up the RL community, I would propose that a large range of tutorials are created.
This is a list of tutorials that could be made
- [x…
-
**Logical attack error:**
AutoPentest-DRL: Compute attack path for logical network...
Generate attack graph using MulVAL...
Process attack graph into attack matrix...
Traceback (most recent call…
-
Hi,大佬
@Crawford-fang
非常感谢大佬可以公开自己的开源代码,我对于深度强化学习训练机器人非常感兴趣,我下载了您的代码,但是运行出现错误,如下:
`/home/he/miniconda3/envs/rostorch/bin/python3.7 /home/he/turtlebot_ws/src/ROS_pytorch_RL/DQN/DQN2.py
Traceback (m…
-
## 🚀 Feature
There seems to be fair few inefficiencies in the RL model code.
In both the VPG and DQN code, the network is computed twice, once to generate the trajectory and then once again in the…