drl-pytorch Search Results

115 results
for drl-pytorch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-ml/tianshou #249

(Probably) Unfair Speed Comparison

Hello, You advertise tianshou as being fast and provide in the readme a comparison table. However, no reference code is linked to reproduce the results. So, I decided to create a colab notebook…

araffin updated 3 years ago
15
xingdi-eric-yuan/TextWorld-Coin-Collector #7

Version issue

I am a beginning learner in DRL. When I run this project on linux, there are many problems due to the incorrect version of the installation packages.I use python=3.6.13,Textworld=1.0.0,pytorch=0.4,te…

lxs3213196 updated 3 years ago
4
DLR-RM/rl-baselines3-zoo #133

[question] make_atari_env() is not used for Atari environmen…

## Question Why does the zoo call standard `make_vec_env()` for all environments, including Atari, when sb3 has a special function for it `make_atari_env()`? ## Train of thought - train.py calls …

rienath updated 2 years ago
12
darshan315/flow_past_cylinder_by_DRL #26

Create python packages' container

- creating docker image (for backup) - creating singularity image

darshan315 updated 3 years ago
5
thu-ml/tianshou #307

Plans of releasing mujoco benchmark of onpolicy algorithms(V…

## Purpose The purpose of this issue(discussion) is to introduce a series of pr in the near future targeted to releasing tianshou's benchmark for MuJoCo Gym task suite using onpolicy algorithms alrea…

ChenDRAG updated 3 years ago
3
kaixindelele/DRL-tensorflow #2

关于TD3中训练更新的频率问题

感谢您的分享！因为你的网络更新和论文作者的不一样，我的疑问如下：本repo中，`for _ in range(args.max_steps)` 这行代码是表示执行`max_step=1000`之后开始更新1000次吗？ ```python if j == args.max_steps - 1: up_st = time.time() …

lknownothing updated 3 years ago
7
open-mmlab/mmdetection #3164

When training I met with a problem: label = self.cat2label[n…

I am using PASCAL_VOC Ddataset to do the training. When I executed the command "python tools/train.py configs/pascal_voc/faster_rcnn_r50_fpn_1x_voc0712.py --gpus 1 --work_dir merge-output" to do the …

Ruolingdeng updated 3 years ago
7
Farama-Foundation/ViZDoom #391

Best way to organize self-play

I am planning to experiment with population-based training and self-play, similar to the recent DeepMind's Q3 CTF paper. The obvious requirement would be the ability to train the agents to play agains…

alex-petrenko updated 3 years ago
47
siekmanj/r2l #5

LSTM Explanation

Hello, good implementation of the DRL algos! I have been struggling quite a bit trying to understand how to incorporate an LSTM into the model and how to handle the hidden states and your repo seemed …

npitsillos updated 3 years ago
7
thu-ml/tianshou #274

Plans of releasing mujoco benchmark with ddpg/sac/td3 on Tia…

## Purpose The purpose of this issue(discussion) is to introduce a series of pr in the near future targeted to releasing a benchmark(sac, td3, ddpg) on mujoco environments. Some features of tianshou …

ChenDRAG updated 3 years ago
5

上一页 1...6 7 8 9 10 11 12...12 下一页

115 results for drl-pytorch

115 results
for drl-pytorch