-
Hi Max,
this is the Dueling DQN implementation from DeepMind: https://arxiv.org/pdf/1511.06581.pdf
![image](https://user-images.githubusercontent.com/23219722/58774004-7d43f780-858d-11e9-8dc8-7b…
-
# The problem
I am trying out the [MNIST for experts tutorial](https://www.tensorflow.org/versions/r0.7/tutorials/mnist/pros/index.html#deep-mnist-for-experts) and I have inconsistent results on the …
-
It seems only the positive bounding boxes might be loaded, is this the case?
![bounding box](https://user-images.githubusercontent.com/34765938/35213959-75afc960-ff9a-11e7-9916-ac11117415c5.png)
A…
-
## 🐛 Bug
The mere execution of `torch.distributions.categorical.Categorical.sample()` without even using its resulting tensor in the loss function seems to have an effect on the gradient calculatio…
-
Traceback (most recent call last):
File "d:/Python/WS/PyTorch/Deep-Reinforcement-Learning-Hands-On/Chapter10/02_pong_a2c.py", line 159, in
tb_tracker.track("advantage", adv_v, step_idx)…
-
I was going through the reinforce example and noticed:
https://github.com/pytorch/examples/blob/ea825a5aa6c2db3743c803821a5e220301ebf5b4/reinforcement_learning/reinforce.py#L91
```
running_rewa…
-
Here is the v0.3 release plan. **The tentative release date is 06/07.**
[Feature] Kernel support
-------------------------
Kernels are critical for our system performance. The next release will i…
-
Hello, does the lib support multi-agent environment?
Or more precisely, allow multiple agents share environment state, select their action in parallel, then return the combined actions to the environ…
-
Thank you for these useful examples. I am trying to implement D4PG in multiple agents that interact with each other, share the same reward, but each agent takes its own actions. I wonder if you had an…
-
E2E tests: https://github.com/pytorch/pytorch/pull/8451
Working models:
- [x] DCGAN example
- [x] Fast Neural Style Transfer (InstanceNormalization bug #8439)
- [x] MNIST (partially working, #…