Closed lucasjinreal closed 2 years ago
Thanks for your interest in this work. Reshaping the feature of the 3D stage from [B, N, C] back to [B, C, H, W] format should work. The detection and segmentation codes are released, please refer to the implementations.
HI, if I look it right, the 4 stages output shape would be something like this:
now that the stage5 is 3 dimension, how should I using these output features to a FPN-like detection architecture?