-
- With small enough state and action space, we can use arrays and tables representations to approximate the value functions.
- But with large exponential state spaces for e.g. tetris with ```10^60```…
-
### 그냥 딥러닝 공부하다가 모르는것들 정리
### Q. What are the differences between 'epoch', 'batch', and 'minibatch'?
**A.** As far as I know, when adopting Stochastic Gradient Descent as learning algorithm,
…
-
Good thing I kept all my research work private, already deep q networks code stolen.
Feel free to contact me if needed in cloudsim scheduling and energy part, I have worked on reinforcement learnin…
-
Hi,
Is Deep Q Learning (Reinforcement Learning) supported by SciSharp? Because I can't find any example.
Thanks!
-
state, stacked_frames = stack_frames(stacked_frames, state, True)
File "doom_rl.py", line 80, in stack_frames
frame = preprocess_frame(state)
File "doom_rl.py", line 71, in preprocess_frame…
-
I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the action is always [0 0 0 0 0 0 1 0]. The age…
-
Hey together,
I am currently working on a university project where the goal is to develop a reinforcement learning agent which controls the traffic light states.
I heard that SUMO is the most appr…
-
Hi!
I’m trying to run SpaceInvaders, but faced problem with:” Game not found: Did you make sure to import the ROM?”. Then I tried solution by MaximusWudy, with point and renaiming files to .a26 (as a…
-
I tried testing the code and it crashes after the 500 episodes of training from the last part.
For `state = stack_frames(stacked_frames, frame)` did you forget a True or False at the end? And possibl…
-
Hey there! First of all: thank you so much for releasing this, documenting things, putting it on pypi, etc. etc., really appreciate it :)
I've been trying to get a fun "search and rescue" example wor…