Some questions about Memory

9p15p commented 4 years ago

Hi, sir! Thank you for your fine work, but I still have some question.

Should we make train_code's memory part(Sth like memory part in eval_code) under "with torch.no_grad"?Does memory require grad?

seoungwugoh commented 4 years ago

@9p15p Yes, memory also requires gradients. In specific, we learn how to encode a memory from [frame, mask]. Naturally, every feed-forward operation is done without torch.no_grad during training.

9p15p commented 4 years ago

Thank you for your reply. I have another question: how should we backpropagate our loss ?IOW, where should we put our "loss.backward"? I have 3 strategies:

calculate loss and put "loss.backward" in every object every frame.(in experiment, we will use it with 'retain_graph=True' twice for the last two frames and after we calculate two frames' loss we use another one without 'retain_graph=True'.
calculate loss in every object every frame but only put "loss.backward" after we calculate all frames' loss.
only calculate loss for the final(third) mask, and ignore the middle(second) mask, and only use one "loss.backward()" in the end.

inf: I use their different colors to pick out different objects in all pictures. I train the model by only using a object with 3 frames and 3masks at a time.

Maybe, none of these three strategies is right. Maybe, my "inf" is improper, we should train all object in the same time.

looking forward to your advice!~ thank you!

seoungwugoh commented 4 years ago

What I did is option 2. We sum-up all the losses and call backward and step at the end of the iteration.

9p15p commented 4 years ago

Thank you, Sir. Although,my best reimplemention is still not satisfactory. But in my own experiment, calling loss back every frame has a better performance and Converges quicker. （use 'retain_graph=True' for the second frame's loss.）.I will try again.

best wishes!

seoungwugoh / STM

Some questions about Memory #14

inf: I use their different colors to pick out different objects in all pictures. I train the model by only using a object with 3 frames and 3masks at a time.