It seems your code still has issue.(Don't quiet understand your solution)

jtkim-kaist / ram_modified

"Recurrent Models of Visual Attention" in TensorFlow

42 stars 9 forks source link

Sorry for the late answer because I'm too busy in these days.

The separation means that 'not sharing gradients'. In this code, you can see that gradients of reinforce part only flow through the location & baseline networks.

your code seems just stop_gradient sampled_locs and not stop_gradient in mean_locs? : No, it is already implemented in original RAM implementation and I think it is correct.
your seperate 2 parts is not seen from the your code? Plz see my code carefully. You can find that gradient flow was separated between (location, baseline net) and glimpse, core network

I'll give you more detailed answer as soon as possible...

jtkim-kaist / ram_modified

It seems your code still has issue.(Don't quiet understand your solution) #1

1. location network, baseline network : learn with gradients of reinforcement learning only.

2. glimpse network, core network : learn with gradients of supervised learning only.