actor-critic-algorithm Search Results

749 results
for actor-critic-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ppaquette/gym-super-mario #4

Controlling emulator speed

Hi Philip, I was wondering whether it's possible to manually set the emulator speed. It'd be nice to further increase the speed, say to 5000%, during training. Additionally, when demoing the RL agent,…

gabegrand updated 7 years ago
11
dennybritz/reinforcement-learning #7

DQN and Dyna-Q

Hi Denny Again, I do appreciate your work! I was thinking of implementing DQN with **Dyna-Q** Algorithm where the **Q(s,a)** is updated not only by **real** experience, but also by **simulated** ex…

IbrahimSobh updated 7 years ago
5
floodsung/DDPG #2

Question regarding the calculation of the actor gradient.

Hi, why did you include the minus sign in the `grad_ys` argument of the bottom function? `self.parameters_gradients = tf.gradients(self.action_output,self.parameters,-self.q_gradient_input/BATCH_SIZ…

Tomakko updated 7 years ago
5
NVlabs/GA3C #22

Trying to compare this to universe-starter-agent (A3C)

Setting up openai/universe, I used the "universe starter agent" as a smoke test. After adjusting the number of workers to better utilize my CPU, I saw the default PongDeterministic-v3 start winnin…

nczempin updated 7 years ago
83
rmst/ddpg #2

Need your help to understand a step

Could you pinpoint the code where actor's parameters (weights) are being updated ? I am particularly looking for the step where gradient of critic is calculated wrt to action variables and actor wrt …

sarvghotra updated 8 years ago
2
openai/gym #16

[working deep RL demo][need help]Lasagne+Agentnet baselines

Greetings! We happen to have just pushed into the open source one of the Lasagne-based library for reinforcement learning algorithm design. - The repo's here: [AgentNet](https://github.com/yandexdatas…

justheuristic updated 8 years ago
10
ai-se/Pits_lda #20

Citemap Results

## Configuration: - Untuned: 10 repeats, and median stability score is taken. Default parameters. - Tuning: On Spark, 300 evaluations ## Results: - Delta, Positive, tuning is better. - Different Combi…

amritbhanu updated 8 years ago
5
siemanko/tensorflow-deepq #8

high dimension states

Great project! I'm looking to use this with a Kinect v2 camera for a robotics application. I have 26 different joints each with x,y,z coordinates that will be my state space. Looking through the code …

animalelement865 updated 8 years ago
6
OFFLlNE/PSP #5

1st iteration feedback

Please find below the feedback from your work done in the 1st iteration. Wiki Homepage (Vision) - Limitations can be set in order to define specifically the purpose of the application, e.g., it can b…

huberflores updated 8 years ago
6

上一页 1...69 70 71 72 73 74 75...75 下一页

749 results for actor-critic-algorithm

749 results
for actor-critic-algorithm