dqn-variants Search Results

63 results
for dqn-variants

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Farama-Foundation/HighwayEnv #89

Questions about results figures

@eleurent Hi eleurent, sorry to bother you again. I am not sure you receive my yesterday question or not. So I want to try again. With your help, now I can train and test the Highway-env with DQN. H…

narutoten520 updated 4 years ago
10
IntelLabs/coach #365

Memory leak in DQN variants

I have been having issues with processes dying from memory exhaustion. I instrumented to code to figure out the rate of consumption and got the following image: ![memory-profile](https://user-images.…

redknightlois updated 5 years ago
1
Farama-Foundation/ViZDoom #363

Using position info to do Deep Reinforcement Learning

Most of the time people use image as input to train a deep neural network to play Doom, anyone thinking about using real number info(position of player/medkit/enemy/ammo) as a vector to train a deep n…

acrushdjn updated 5 years ago
3
tensorforce/tensorforce #136

Question about PolicyGradientModel.reward_estimation

No matter whether baseline is used, ` PolicyGradientModel.reward_estimation ` computes cumulative rewards in one batch by using ` util.cumulative_discount ` with ` cumulative_start=0.0 ` . In my op…

0xSSoul updated 5 years ago
6
openai/coinrun #31

why the next_state never changes?

Please help me understand why the previous state is always equal to the next state ? if thats the case how will any NN will work on state. ``` import numpy as np from q_learning.utils import Sca…

Unimax updated 5 years ago
4
LuEE-C/PPO-Keras #2

Convergence

Hi, I have started using this on Lunar-Lander-v2 which is not continuous and I do not yet see convergence. Did this code converged you were testing? Also for training these simple cases, did you …

aliostad updated 5 years ago
31
MillionIntegrals/vel #1

Implement policy gradient reinforcement learning algorithms

My next step is to have clean working and benchmarked policy gradient reinforcement learning algorithms.

MillionIntegrals updated 5 years ago
7
tensorforce/tensorforce #182

Feature Request: NoisyNet for exploration

The introduction of *NoisyNets* appears to add significant explorative capability to DQN & A3C, in some cases making progress on tasks that had otherwise exhibited little advancement. I wasn't sure…

ImpulseAdventure updated 6 years ago
7
ray-project/ray #3809

Thrift Install fails on Ubuntu 16

On the latest AWS DL AMI, I get strange errors trying to install ray from source. Not sure how to debug them: ``` Replacing /home/ubuntu/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages…

dmadeka updated 5 years ago
6
Kismuz/btgym #54

Using 5-,10-,30-minute data feed rather than 1-minute and th…

HI, I spent some time looking over the guided_a3c example, documentation and reading the code for the BTgymRandomDataDomain class, and my question is: if I want to use pandas.resample() function and d…

ALevitskyy updated 5 years ago
43

上一页 1...1 2 3 4 5 6 7...7 下一页

63 results for dqn-variants

63 results
for dqn-variants