Hi there, I was wondering whether the Mask2FormerOccHead can be adapted to use a simple per-voxel loss such as CE (or Geo-Scal loss from MonoScene). I find that using mask and cls paradigm in Mask2Former is kinda difficult to apply known loss function in voxel space. Can you think of a way to adapt this in your code? Thank you
Hi there, I was wondering whether the Mask2FormerOccHead can be adapted to use a simple per-voxel loss such as CE (or Geo-Scal loss from MonoScene). I find that using mask and cls paradigm in Mask2Former is kinda difficult to apply known loss function in voxel space. Can you think of a way to adapt this in your code? Thank you