thushv89 / AdaCNN

AdaCNN algorithm. Clean implementation
0 stars 0 forks source link

Major Bug: In updating target network weights #31

Closed thushv89 closed 6 years ago

thushv89 commented 6 years ago

The weights were being updated wrongly. If w is the weights of actor/critic and w' are the weights of target actor/critic, I was doing w' <- w' tau + (1-tau) w' instead of w' <- w tau + (1-tau) w'