I read your code. I have some questions about the code details.
When using the ST-RNN, the masks are input into GRU as a parameter. Could you explain why you did this? What is the function of masks input?
The masks are the same as "done" in the gym environment. If an episode is done, we use masks to clear the hidden states of GRU to all zeros. See this line.
Hi,
I read your code. I have some questions about the code details. When using the ST-RNN, the masks are input into GRU as a parameter. Could you explain why you did this? What is the function of masks input?
Thanks!