dqn-pytorch Search Results

535 results
for dqn-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/fairseq #2591

Question about "FairseqEncoderDecoder output dimension" & De…

Why FairseqDecoder get_normalized_probs uses output[0] as input for softmax function? (it results in dimension mismatch for my loss calculation) I was trying to implement my own model (LSTM based, …

steventan0110 updated 4 years ago
2
praveen-palanisamy/Ape-X-DQN #1

Hi, the codes of "gather_experience " is missing in your ac…

I am a new guy to the apex DQN. Thanks for your contribution. Your codes help me to understand this algorithm. please also add the codes of the gather_experience from the Actor class. ```python …

tbwxmu updated 4 years ago
4
Shivanshu-Gupta/Pytorch-Double-DQN #1

Get actions for next states by 'Qnet' or 'target_Qnet'?

Hello, thank you so much for sharing this code structure! I got one thing no very sure about in your code. https://github.com/Shivanshu-Gupta/Pytorch-Double-DQN/blob/1cff44d95d7881c6afc029b734508b1…

zuzhaoye updated 4 years ago
2
thu-ml/tianshou #231

Question about logits

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [x] RL algorithm bug + [x] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

endvroy updated 4 years ago
4
ray-project/ray #10158

[RLLib] Torch Policy modification of action selection functi…

### Describe your feature request Hello, In its current form the action selection in PyTorch uses either `compute_actions_from_dict` or `compute_actions` (the latter creates an input dict). Both…

raphaelavalos updated 4 years ago
1
DLR-RM/stable-baselines3 #93

[enhancement] Polyak Averaging could be done faster

This is rather minor, but polyak averaging in DQN/SAC/TD3 could be done faster with far fewer intermediate tensors using `torch.addcmul_` https://pytorch.org/docs/stable/torch.html#torch.addcmul.

m-rph updated 4 years ago
5
ray-project/ray #8688

[rllib] valuation script tryies to convert pytorch cuda ten…

When using rllib and turning on evaluation, rllib tries to convert pytorch cuda tensor to numpy and fails with exception "TypeError: can't convert cuda:0 device type tensor to numpy. Use Tensor.cpu() …

PgLoLo updated 4 years ago
7
facebookresearch/ReAgent #111

How to configure to run the examples on the CPU?

I followed instructions from here: https://github.com/facebookresearch/Horizon/blob/master/docs/installation.md to run Docker image on Mac. However when I am running the example, getting following …

ArmenLevoni updated 4 years ago
5
pytorch/pytorch #41657

AttributeError: 'numpy.ndarray' object has no attribute 'dim…

I am trying to implement a deep Q network using pytorch. For sampling the actions I am using the code as shown in the snippet: ``` def select_action(self, state): if random.uniform(0, 1) …

veds12 updated 4 years ago
2
pytorch/pytorch #34223

reinforcement learning dataloading and algorithms

## 🚀 Feature Implement a dataloading functionality for reinforcement learning state, action pairs, with assigned policy scores, transitional probabilities and rewards. Implement a set of gradient al…

alexge233 updated 4 years ago
4

上一页 1...38 39 40 41 42 43 44...54 下一页

535 results for dqn-pytorch

535 results
for dqn-pytorch