dqn Search Results - Githubissues

1000+ results
for dqn

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

coreylynch/async-rl #7

t_max = 32

Hello, In the A3C paper they state t_max = 5, is there any reason you set it to 32? Actually I don't really understand why the batch size should be so small, why shouldn't we use traditional batch s…

etienne87 updated 7 years ago
6
tensorflow/agents #727

clarification on the usage of observation_and_action_constra…

hi, im trying to create a DqnAgent agent with a mask for valid/invalid actions, according to [this post][1] , i should specify a ```splitter_fn``` for the ```observation_and_action_constraint_splitte…

Johneinsteinwong updated 2 years ago
2
kuz/DeepMind-Atari-Deep-Q-Learner #17

Why I can run breakout roms,but others failed

I used the source and run the program, success But when I download a new rom file from "http://atariage.com/company_page.html?CompanyID=1&SystemID=2600&SystemFilterID=2600" and run ,for example "3d_ti…

SaintJack updated 7 years ago
9
tkn-tub/ns3-gym #79

Using ns3gym in Sable-Baseline3's Vevtorized Environment

I've been experimenting with ns3gym for an LTE scenario. In order to speed up learning, I would like to use the vectorized environment support provided in Stable-Baseline3. For this, I've tried the fo…

shyampparekh updated 1 year ago
1
thu-ml/tianshou #567

What paper or reference is the RNN implementation trying to …

- [x] I have marked all applicable categories: + [ ] exception-raising bug + [x] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [ ] ne…

BFAnas updated 9 months ago
1
leo-p/papers #21

Neural Episodic Control

https://arxiv.org/pdf/1703.01988.pdf Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of…

leo-p updated 7 years ago
1
banksemi/mpquic-rl #1

If I changed the shcduler.go code to fit my algorithm, how t…

Hello, appreciate your repository. Still I got some questions. If I changed the shcduler.go code to fit my algorithm, how to deploy the scheduler to use? I'm a new one to this field. I ran "build.sh"…

QzP-QD updated 1 year ago
3
ROBOTIS-GIT/turtlebot3_machine_learning #56

Robot stuck/ slow when training

Hi, Thanks so much for the repos, I have a problem that when the memory is >64, then the script start to train the NN model. This makes the robot use the last cmd_vel it was told. The cmd_vel onl…

Qtsho updated 6 months ago
4
Alessiobrini/Deep-Reinforcement-Trading-with-Predictable-Returns #2

Module for behavioral cloning

Insert a module after the loss computation of the algorithm that perturb the parameters by doing behavioral cloning from an expert (Garleanu and Pedersen solution)

Alessiobrini updated 5 months ago
6
gkvraman/Congestion-Control-Algorithms-Using-DQN #2

A problem about new_ssThresh

I found that when training rl-tcp in ns3gym, according to the definition, action = [new_ssThresh, new_cWnd], but there was no updated threshold in the training log for each step. I haven't found the…

Cheryl0605 updated 4 years ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for dqn

1000+ results
for dqn