dqn-pytorch Search Results

532 results
for dqn-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-research/batch_rl #22

Reading atari files directly.

Hi, contributing this example of how to read the atari files directly, in case anyone wants to do that. Note that the data is stored in the same temporal sequence it was logged, as you can see by w…

DuaneNielsen updated 3 years ago
3
pytorch/pytorch #31423

Add option in LSTM layer to access all cell states of all ti…

## 🚀 Feature The LSTM layer in torch.nn should have the option to output the cell states of all time steps along with the hidden states of each time step. ## Motivation When implementing Re…

NotNANtoN updated 1 year ago
7
pytorch/rl #1344

[Feature Request] Q Ensembles

## Motivation Twin Q/ensemble Q functions are used in many RL algorithms and mitigate Q overestimation. My understanding is that TorchRL only deals with ensembles in the loss functions. This is fine …

smorad updated 1 year ago
5
ray-project/ray #22976

[Bug] AdaBelief optimizer crashes checkpoint restore

### Search before asking - [X] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues. ### Ray Component RLlib ### Issue Severity Medium: It contributes t…

wwoods updated 2 years ago
4
XinJingHao/Actor-Sharer-Learner #1

ataris tennis 我保持了原论文的参数跑，reward 50M之前没有超过0 为什么

Redhair957 updated 4 days ago
6
pytorch/pytorch #24398

Significantly slower in latest version than in 0.4.0

Hi, if I run the following script to train a model, one can aware significantly performance drawbacks in 1.2.0 than in 0.4.0. ``` import torch import torch.nn as nn import torch.optim as opt…

YimengZhu updated 2 years ago
10
thu-ml/tianshou #1225

Major Performance Decrease in Tianshou 1.2 Compared to 0.5 o…

Hello, I used Tainshou 0.5 on a custom environment running on a Windows PC. I was impressed by the training speed of the PPO agent, which exceeded 2000 iterations per second. ```python import t…

ULudo updated 1 week ago
5
yandexdataschool/Practical_RL #84

Broken links

I used the following two commands to identify broken links. `markdown-link-check` is https://github.com/tcort/markdown-link-check ``` bash find ./Practical_RL/ -type f -name '*.ipynb' -exec jupyt…

yagudin updated 4 years ago
9
datawhalechina/easy-rl #40

/chapter7/chapter7

https://datawhalechina.github.io/easy-rl/#/chapter7/chapter7 Description

qiwang067 updated 2 weeks ago
14
Farama-Foundation/HighwayEnv #472

In a multi-agent environment, the convergence issue.

Thank you for your excellent work. However, when I train multiple agents in a highway environment, the network doesn't seem to converge. The checkpoint rewards obtained after training for every thou…

Mingjie-He updated 5 months ago
5

上一页 1...5 6 7 8 9 10 11...54 下一页

532 results for dqn-pytorch

532 results
for dqn-pytorch