Open seon-creator opened 10 months ago
The test-time transform called BottomupResize
cannot guarantee that the input image will be square. Instead, it typically resizes the image so that its shortest edge is the same length as the specified width and height of input_size
. If the width and height of input_size
are different, it could cause problems in this process based on the following code snippet.
https://github.com/open-mmlab/mmpose/blob/efe09cd5268d4d6b21100334fbf2947ef36fc7db/mmpose/datasets/transforms/bottomup_transforms.py#L521-L530
Thank you for your answer!
The test-time transform called
BottomupResize
cannot guarantee that the input image will be square. Instead, it typically resizes the image so that its shortest edge is the same length as the specified width and height ofinput_size
. If the width and height ofinput_size
are different, it could cause problems in this process based on the following code snippet.
So, is there any way to change which part of the code makes it possible to get good test results even with rectangular input sizes? My custom dataset is 1920×1080 resolution and the test results for the DEKR model are poor.
So, is there any way to change which part of the code makes it possible to get good test results even with rectangular input sizes? My custom dataset is 1920×1080 resolution and the test results for the DEKR model are poor.
You can try to set the input_size
to (1080, 1080). During training, the image will be randomly resized and cropped to 1080x1080. During inference, the image will be resized to 1920x1080.
So, is there any way to change which part of the code makes it possible to get good test results even with rectangular input sizes? My custom dataset is 1920×1080 resolution and the test results for the DEKR model are poor.
You can try to set the
input_size
to (1080, 1080). During training, the image will be randomly resized and cropped to 1080x1080. During inference, the image will be resized to 1920x1080.
Thank you very much! I will try it.
Prerequisite
Environment
mmcv 2.0.1 mmdet 3.0.0 mmengine 0.8.4 mmpose 1.1.0 mmpretrain 1.0.2
Reproduces the problem - code sample
Reproduces the problem - command or script
python tools/train.py configs/body_2d_keypoint/dekr/custom_coco/dekr_hrnet-w32_8xb10-140e_coco-256x192.py
Reproduces the problem - error message
The train.py code works with not error, but after finishing train when I use the weight to infer, the performance was worst. Compare to square input, the keypoint detection didn't work good. When I use this model with default input size 256x256, this model works well, but I have a question. Why the DEKR, bottomupPoseEstimation method didn't work well with rectangle input size? Unlike Top down models, why bottomupPoseEstimation model didn't works well with 256x192 or other rectangular resize?
Additional information
No response