Closed Chttan closed 2 years ago
I am not very clear with this "However, there are many frames where there are more than 3 keypoints being detected and drawn, sometimes up to a total of 6. The additional keypoints are usually adjacent to the existing keypoints." I am sorry but I can't imagine it in my mind. I think it will be helpful, if you can post some example images here.
BTW, I will recommend training a top-down model along with a detection model in this case.
Hello,
Thank you for the timely response!
About the first question: the dataset is too small, I think it is not sufficient to train a good bottom-up model. As it always requires more data to achieve good performance.
What size of a dataset would you suggest to achieve good results for a bottom up model?
I am not very clear with this "However, there are many frames where there are more than 3 keypoints being detected and drawn, sometimes up to a total of 6. The additional keypoints are usually adjacent to the existing keypoints." I am sorry but I can't imagine it in my mind. I think it will be helpful, if you can post some example images here.
Please find a representative image illustrating the problem I am facing:
My skeleton is defined as only 3 keypoints, however, more are being detected in some frames. To me it seems likely that 2 skeletons are being drawn. The additional keypoints/skeleton appears in about 1/3 of the inferred frames and I am not sure what might cause this.
About the assertionError. This is because bottom-up models do not support batchsize>1 for inference. You may simply set the batchsize=1 in the config during inference.
I will give this a try, thank you!
BTW, I will recommend training a top-down model along with a detection model in this case.
I have given this a try, but am not having luck with the detection model. I will take another shot at it.
Thank you again for your help!
Sorry for the late reply. Could you please print the predicted poses?
I assume that the model detects two sets of poses. And you can easily keep one and discard the other.
Have you tried lower nms threshold?
https://github.com/open-mmlab/mmpose/blob/4297d1e932b2b346af71de9f98e3774ae0b66aec/demo/bottom_up_img_demo.py#L41
Hello, my apologies for the delay in replying.
Could you please print the predicted poses? I assume that the model detects two sets of poses. And you can easily keep one and discard the other.
Yes, this is what it seems to have been doing. For one way to resolve my issue, I did try filtering out one of the detected poses.
Have you tried lower nms threshold?
Setting nms threshold to 0.5 also filtered out the extra skeleton.
The third method I used was to set max_num_people to 1, but this was less than ideal as it removes the possibility to track more than one animal.
Thank you for all of your help and suggestions!
Greetings,
First, thank you for your efforts and impressive project.
I am trying to train a bottom-up model to detect a single animal using a custom dataset. I have around 600 annotated images, with 3 keypoints and bounding boxes. In the images, there are typically 2 animals, one with explicit markings, one without.
I am training the model for 400 Epochs and inferring using
./demo/bottom_up_video_demo.py
Looking at the output video, the correct animal is being tracked, with the 3 keypoints generally near the desired locations. However, there are many frames where there are more than 3 keypoints being detected and drawn, sometimes up to a total of 6. The additional keypoints are usually adjacent to the existing keypoints. I am having trouble understanding what might cause this and what changes I should make when training to resolve this issue. Could you offer some help please?
Please find my environment, collected using
./utils/collect_env.py
:I train using
./tools/train.py
. I am unable to provide complete training output logs, as for some reason the output does not show epoch progression:Additionally, during the evaluation step, I receive an error:
Both of these training issues do not appear when training a top down model.
I have tried some of the suggestions found in issue #347 to improve my bottom up model, but still no luck. Do you have any suggestions about what I should try next?
Thank you!