issues
search
spragunr
/
deep_q_rl
Theano-based implementation of Deep Q-learning
BSD 3-Clause "New" or "Revised" License
1.08k
stars
348
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Error memory
#67
damienlancry
closed
6 years ago
1
ImportError: No module named cuda.var in theano version 0.9.0
#66
AnushaManila
closed
7 years ago
0
Dev
#65
kartikay94
opened
7 years ago
0
Illegal instruction (core dumped) when loading the ROM
#64
yanpanlau
opened
7 years ago
0
Update README.md
#63
gabriel-komaromy
opened
7 years ago
0
learning.csv doesn't contain average loss per epoch
#62
redsphinx
opened
8 years ago
0
We must choose the network when watching it play using the ale_run_watch.py script
#61
yzsatgithub
opened
8 years ago
1
Fixes to testing code.
#60
spragunr
closed
8 years ago
0
Issue on Input Shape of network
#59
taiharry108
opened
8 years ago
1
UnusedInputError
#58
uniwf2016
opened
8 years ago
0
ValueError: numpy.dtype has the wrong size, try recompiling
#57
xphongvn
opened
8 years ago
0
plot_results.py: IndexError: index 3 is out of bounds for axis 1 with size 2
#56
mw66
opened
8 years ago
0
Changed cPickle save/load files to binary.
#55
erdememekligil
closed
8 years ago
0
cPickle save/load pkl file on Windows
#54
erdememekligil
opened
8 years ago
0
Adding CPU versions of networks and scripts
#53
kmader
opened
8 years ago
0
Implementation of Double DQN
#52
corywalker
opened
8 years ago
12
ale-python-interface is not work
#51
wenzezhang
opened
8 years ago
1
something wrong with referenced Lasagne
#50
hutaocheng
opened
8 years ago
0
record identifying information with each experiment
#49
davidsj
opened
8 years ago
1
Performance improvements
#48
davidsj
closed
8 years ago
6
fix gradient being zeroed when diff is clipped
#47
davidsj
closed
8 years ago
2
Gradient zero when diff is clipped
#46
davidsj
closed
8 years ago
10
Refactor DataSet to use numpy arrays as circular buffers
#45
davidsj
closed
8 years ago
2
dump json parameters with each run
#44
omnivert
closed
8 years ago
1
Training on nips paper doesn't run
#43
lahwran
opened
9 years ago
0
Decoupling to increasing reusability
#42
jleni
closed
9 years ago
0
Decoupling to increasing reusability
#41
jleni
closed
9 years ago
3
"reward_per_epoch" should be "reward_per_episode" in results.csv
#40
davidsj
opened
9 years ago
1
Small bug fix for mean_q
#39
Ivanopolo
closed
9 years ago
0
Automatically using getScreenGrayscale (new) when available
#38
jleni
closed
9 years ago
2
Added deterministic options
#37
Ivanopolo
closed
9 years ago
1
Osx improvements
#36
udibr
closed
9 years ago
0
Decoupling to promote reusability
#35
jleni
closed
9 years ago
5
Training results don't match Deepmind implementation
#34
spragunr
closed
8 years ago
7
Reproducibility
#33
Ivanopolo
closed
9 years ago
4
Use the initialisation procedure used by DeepMind for testing
#32
alito
closed
9 years ago
3
Don't count unfinished games towards the average score for the epoch
#31
alito
closed
9 years ago
3
ROM file wont load
#30
arccoxx
closed
8 years ago
11
Memory error, sysmalloc: Assertion failed
#29
temporonni
closed
9 years ago
3
Fix OS X display screen (and README for how to install on OSX)
#28
udibr
closed
9 years ago
2
README and code fix for OS X
#27
udibr
closed
9 years ago
0
Minor ale_data_set.py fix to make it Mac-compatible
#26
Hiyorimi
closed
9 years ago
1
fixed compiling of pyx and pyximporting with proper numpy paths
#25
sin-mike
closed
9 years ago
1
c01b should be set to True only when dimshuffle is False.
#24
alito
closed
9 years ago
1
Do the initial collection of frames with epsilon set to whatever it i…
#23
alito
closed
9 years ago
0
Choose action with epsilon set to self.epsilon instead of 1.0 prior to hitting replay size
#22
alito
closed
9 years ago
2
Switch to Lasagne. Incorporate features from Nature paper.
#21
spragunr
closed
9 years ago
0
What is the memory requirement for this program
#20
gaoyuankidult
closed
9 years ago
3
Update cc_layers.py
#19
gaoyuankidult
closed
9 years ago
0
impot host_from_gpu does not work with Theano 0.7.0
#18
bstadie
closed
9 years ago
1
Next