Open liangchengwu opened 5 years ago
What are the input data modalities for two streams? Depth maps and Skeleton? or RGB and optical flow?
For the output of two streams, did you do the fusion at a fully connected layer and then fed them to softmax for label prediction?
Thanks!
Sorry, the reply is too late. The input is the 3D skeleton data of NTU RGB+D, and the data of the two streams is spliced and sent to the fully connected layer after the feature extraction.
What are the input data modalities for two streams? Depth maps and Skeleton? or RGB and optical flow?
For the output of two streams, did you do the fusion at a fully connected layer and then fed them to softmax for label prediction?
Thanks!