pytorch-reinforcement-learning Search Results

724 results
for pytorch-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PacktPublishing/Deep-Reinforcement-Learning-Hands-On #52

Dueling DQN implementation possibly wrong?

Hi Max, this is the Dueling DQN implementation from DeepMind: https://arxiv.org/pdf/1511.06581.pdf ![image](https://user-images.githubusercontent.com/23219722/58774004-7d43f780-858d-11e9-8dc8-7b…

NikEyX updated 5 years ago
8
tensorflow/tensorflow #2732

Mention that GPU reductions are nondeterministic in docs

# The problem I am trying out the [MNIST for experts tutorial](https://www.tensorflow.org/versions/r0.7/tutorials/mnist/pros/index.html#deep-mnist-for-experts) and I have inconsistent results on the …

shiviser updated 5 years ago
52
tnikolla/robot-grasp-detection #14

Multiple bboxes, positive and negative bboxes?

It seems only the positive bounding boxes might be loaded, is this the case? ![bounding box](https://user-images.githubusercontent.com/34765938/35213959-75afc960-ff9a-11e7-9916-ac11117415c5.png) A…

ahundt updated 5 years ago
8
pytorch/pytorch #23817

`probs.sample()` affects reproducibility of the gradient.

## 🐛 Bug The mere execution of `torch.distributions.categorical.Categorical.sample()` without even using its resulting tensor in the loss function seems to have an effect on the gradient calculatio…

vwxyzjn updated 5 years ago
2
PacktPublishing/Deep-Reinforcement-Learning-Hands-On #23

02_pong_a2c.py not working with argument --cuda

Traceback (most recent call last): File "d:/Python/WS/PyTorch/Deep-Reinforcement-Learning-Hands-On/Chapter10/02_pong_a2c.py", line 159, in tb_tracker.track("advantage", adv_v, step_idx)…

iimmer updated 5 years ago
3
pytorch/examples #512

mistake on reinforce examples

I was going through the reinforce example and noticed: https://github.com/pytorch/examples/blob/ea825a5aa6c2db3743c803821a5e220301ebf5b4/reinforcement_learning/reinforce.py#L91 ``` running_rewa…

brando90 updated 5 years ago
1
dmlc/dgl #450

[Roadmap] v0.3 release checklist

Here is the v0.3 release plan. **The tentative release date is 06/07.** [Feature] Kernel support ------------------------- Kernels are critical for our system performance. The next release will i…

jermainewang updated 5 years ago
27
astooke/rlpyt #14

Multi-Agent Env Support?

Hello, does the lib support multi-agent environment? Or more precisely, allow multiple agents share environment state, select their action in parallel, then return the combined actions to the environ…

wangwwno1 updated 4 years ago
30
PacktPublishing/Deep-Reinforcement-Learning-Hands-On #41

Multi-agent D4PG

Thank you for these useful examples. I am trying to implement D4PG in multiple agents that interact with each other, share the same reward, but each agent takes its own actions. I wonder if you had an…

canteli updated 5 years ago
2
pytorch/pytorch #8452

[JIT][meta] Hybrid front-end example-level robustness testin…

E2E tests: https://github.com/pytorch/pytorch/pull/8451 Working models: - [x] DCGAN example - [x] Fast Neural Style Transfer (InstanceNormalization bug #8439) - [x] MNIST (partially working, #…

jamesr66a updated 5 years ago
1

上一页 1...65 66 67 68 69 70 71...73 下一页

724 results for pytorch-reinforcement-learning

724 results
for pytorch-reinforcement-learning