I am trying to generate some visuals with ViewFormer on CO3Dv2 and I would like to double check a few things.
The changes that I know from v1 to v2 are:
1) the input image is now 4 channels, with the first 3 being masked rgb with black background, and the last channel being a binary mask.
However, I am getting very different results using the same code but with different models.
The first gif is rendered using co3d-10cat-noloc-transformer-tf while the second gif is rendered using co3dv2-all-noloc-transformer-tf.
The first gif looks reasonable but the second gif looks suspicious.
It would be great if you can provide some pointers for me to debug this. Thank you so much!
I am trying to generate some visuals with ViewFormer on CO3Dv2 and I would like to double check a few things.
The changes that I know from v1 to v2 are: 1) the input image is now 4 channels, with the first 3 being masked rgb with black background, and the last channel being a binary mask.
However, I am getting very different results using the same code but with different models.
The first gif is rendered using
co3d-10cat-noloc-transformer-tf
while the second gif is rendered usingco3dv2-all-noloc-transformer-tf
.The first gif looks reasonable but the second gif looks suspicious.
It would be great if you can provide some pointers for me to debug this. Thank you so much!