-
Dear Developers,
I'm getting the following error when running the code below
> pearl/neural_networks/common/value_networks.py", line 262, in get_q_values
x = torch.cat([state_batch, action_batc…
-
why multiply by action and use reduce sum instead of argmax?
-
When creating`model_2` and trying to load the weights by
```python
model_2.load_weights(checkpoint_path)
```
I'm getting the following error:
```
-----------------------------------------…
-
Hi
Thank you so much for your contribution. This is a really great repo for students.
I think it will be very nice if we can try the atari offline training with some recently proposed methods.
…
-
Hi may I know which gpus are used, along with numbers, for training in following papers?
1. Bao, Wentao, Qi Yu, and Yu Kong. "Uncertainty-based traffic accident anticipation with spatio-temporal re…
-
- [ ] #4
- [ ] #12
- [x] #5
- [ ] #6
- [ ] #7
-
Hi, I had some trouble when I was planning to simulate _Q-Learning Algorithm for VoLTE Closed Loop Power Control in Indoor Small Cells_. In order to switch to tabular environment, I did the following…
-
In the paper replicating code when I write the following:
```
# For this notebook to run with updated APIs, we need torch 1.12+ and torchvision 0.13+
try:
import torch
import torchvision…
-
For reference, we will collect a list of discussed papers as well as the date of discussion in this issue.
leezu updated
7 years ago
-
I want to make a project using reinforcement learning in which a bot send scam to other bots on social media, other bots detect the scam and reject it.
I think it needs a deep reinforcement learning…