Thank you for the excellent work. I have a question regarding the preprocessing step. Could you please explain why the input image needs to be cropped before being fed into the pipeline? Is this related to how the model was trained?
I experimented with using the original, uncropped image, but the generated results were not as good. Specifically, when feeding the original image into the pipeline, the side views often have a 'long tail' behind the object.
Because the data is scaled into the [-1,1] box during training, implicitly the model learns that either the left and right will top the box or the front and back will top it.
Hello,
Thank you for the excellent work. I have a question regarding the preprocessing step. Could you please explain why the input image needs to be cropped before being fed into the pipeline? Is this related to how the model was trained?
I experimented with using the original, uncropped image, but the generated results were not as good. Specifically, when feeding the original image into the pipeline, the side views often have a 'long tail' behind the object.
Thank you for your assistance.