Closed nianniana closed 3 years ago
Hi there,
Thanks for your interest! Have a look at the following code pointers:
(N=batch size, T=number of frames / number of windows, C=number of feature channels, V=number of nodes) This would be easier to understand if you try tracing the dimensions of the tensors. Hope this helps.
i am intereted in your great work. However, in the network architecture figure in the paper, i am confused about the detailed operation of "collapse window reshape and fc", would you please explain the procedure in detail. Thanks a lot!