-
- With small enough state and action space, we can use arrays and tables representations to approximate the value functions.
- But with large exponential state spaces for e.g. tetris with ```10^60```…
-
### 그냥 딥러닝 공부하다가 모르는것들 정리
### Q. What are the differences between 'epoch', 'batch', and 'minibatch'?
**A.** As far as I know, when adopting Stochastic Gradient Descent as learning algorithm,
…
-
Hi,
Is Deep Q Learning (Reinforcement Learning) supported by SciSharp? Because I can't find any example.
Thanks!
-
Good thing I kept all my research work private, already deep q networks code stolen.
Feel free to contact me if needed in cloudsim scheduling and energy part, I have worked on reinforcement learnin…
-
state, stacked_frames = stack_frames(stacked_frames, state, True)
File "doom_rl.py", line 80, in stack_frames
frame = preprocess_frame(state)
File "doom_rl.py", line 71, in preprocess_frame…
-
I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the action is always [0 0 0 0 0 0 1 0]. The age…
-
Hi!
I’m trying to run SpaceInvaders, but faced problem with:” Game not found: Did you make sure to import the ROM?”. Then I tried solution by MaximusWudy, with point and renaiming files to .a26 (as a…
-
I tried testing the code and it crashes after the 500 episodes of training from the last part.
For `state = stack_frames(stacked_frames, frame)` did you forget a True or False at the end? And possibl…
-
Hey there! First of all: thank you so much for releasing this, documenting things, putting it on pypi, etc. etc., really appreciate it :)
I've been trying to get a fun "search and rescue" example wor…
-
I try to train Doom on my pc, and use the same code on the page.
But each time after I training it a while, it occur memory error in stack-frame process.
I check my memory usage while training, it k…