Thanks for the good research. I have custom data for object detection, but I lack data for fine-tuning. I wonder if da-fusion can be applied. Even if semantic information is maintained, I think GT's bounding box coordinates will not be accurately mapped to the generated image. What do you think?
Thanks for the good research. I have custom data for object detection, but I lack data for fine-tuning. I wonder if da-fusion can be applied. Even if semantic information is maintained, I think GT's bounding box coordinates will not be accurately mapped to the generated image. What do you think?