quanghuy0497 / Mask_R-CNN

A modified version of Mask R-CNN based on Matterport's version. Featuring: polygon annotating mask generation and k-fold cross-validation training.
15 stars 7 forks source link

Testing stage #2

Closed Nawaffarhan closed 2 years ago

Nawaffarhan commented 2 years ago

I am confused in training part using 28x28 for compute loss function. In the evaluation or testing part to compute AP used the 28x28 after that up sampling with therthold 0.5 or upsampling to compute overlap??

quanghuy0497 commented 2 years ago

Your question to me is a little confusing, so I encourage you should go to the original repository Matterport/Mask_RCNN looking for the answer. If you want to imply the threshold of 0.5 when computing AP, that is just the IoU threshold to determine if the predicted mask/bounding box is correct with the ground truth (see how AP is computed for further detail)

Nawaffarhan commented 2 years ago

My question is when evaluate iou mask for evaluate AP with different therthold. The predict mask is 28x28 to evaluate AP or resize to original size of roi to evaluate AP

On Sun, May 1, 2022, 4:26 PM Quang-Huy Nguyễn @.***> wrote:

Closed #2 https://github.com/quanghuy0497/Mask_R-CNN/issues/2.

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#event-6529523856, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJOADSSTGXQ5QUBEEO72DOTVH2BBVANCNFSM5UZNQIKA . You are receiving this because you authored the thread.Message ID: @.***>

quanghuy0497 commented 2 years ago

I believe the predicted mask is evaluated AP with the size of 28x28.

Vào 20:32, CN, 1 thg 5, 2022 Nawaffarhan @.***> đã viết:

My question is when evaluate iou mask for evaluate AP with different therthold. The predict mask is 28x28 to evaluate AP or resize to original size of roi to evaluate AP

On Sun, May 1, 2022, 4:26 PM Quang-Huy Nguyễn @.***> wrote:

Closed #2 https://github.com/quanghuy0497/Mask_R-CNN/issues/2.

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#event-6529523856, or unsubscribe < https://github.com/notifications/unsubscribe-auth/AJOADSSTGXQ5QUBEEO72DOTVH2BBVANCNFSM5UZNQIKA

. You are receiving this because you authored the thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1114240155, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJBSUHPKWXEX5JMWOW5BON3VH2BXRANCNFSM5UZNQIKA . You are receiving this because you modified the open/close state.Message ID: @.***>

Nawaffarhan commented 2 years ago

Thank you for the clarification.

On Sun, May 1, 2022, 4:39 PM Quang-Huy Nguyễn @.***> wrote:

I believe the predicted mask is evaluated AP with the size of 28x28.

Vào 20:32, CN, 1 thg 5, 2022 Nawaffarhan @.***> đã viết:

My question is when evaluate iou mask for evaluate AP with different therthold. The predict mask is 28x28 to evaluate AP or resize to original size of roi to evaluate AP

On Sun, May 1, 2022, 4:26 PM Quang-Huy Nguyễn @.***> wrote:

Closed #2 https://github.com/quanghuy0497/Mask_R-CNN/issues/2.

— Reply to this email directly, view it on GitHub <https://github.com/quanghuy0497/Mask_R-CNN/issues/2#event-6529523856 , or unsubscribe <

https://github.com/notifications/unsubscribe-auth/AJOADSSTGXQ5QUBEEO72DOTVH2BBVANCNFSM5UZNQIKA

. You are receiving this because you authored the thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub < https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1114240155 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AJBSUHPKWXEX5JMWOW5BON3VH2BXRANCNFSM5UZNQIKA

. You are receiving this because you modified the open/close state.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1114241482, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJOADSX4W7AAZD4Q7NUSRHTVH2CR3ANCNFSM5UZNQIKA . You are receiving this because you authored the thread.Message ID: @.***>

Nawaffarhan commented 2 years ago

Please I has questions could you please help me.

The question is if used 1024x1024 image size and finally has RoI is 8x8

and fixed mask feature is 14x14, how could convert small size of feature map to bigger size?

On Sun, May 1, 2022, 4:40 PM suhila owida @.***> wrote:

Thank you for the clarification.

On Sun, May 1, 2022, 4:39 PM Quang-Huy Nguyễn @.***> wrote:

I believe the predicted mask is evaluated AP with the size of 28x28.

Vào 20:32, CN, 1 thg 5, 2022 Nawaffarhan @.***> đã viết:

My question is when evaluate iou mask for evaluate AP with different therthold. The predict mask is 28x28 to evaluate AP or resize to original size of roi to evaluate AP

On Sun, May 1, 2022, 4:26 PM Quang-Huy Nguyễn @.***> wrote:

Closed #2 https://github.com/quanghuy0497/Mask_R-CNN/issues/2.

— Reply to this email directly, view it on GitHub <https://github.com/quanghuy0497/Mask_R-CNN/issues/2#event-6529523856 , or unsubscribe <

https://github.com/notifications/unsubscribe-auth/AJOADSSTGXQ5QUBEEO72DOTVH2BBVANCNFSM5UZNQIKA

. You are receiving this because you authored the thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub < https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1114240155 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/AJBSUHPKWXEX5JMWOW5BON3VH2BXRANCNFSM5UZNQIKA

. You are receiving this because you modified the open/close state.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1114241482, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJOADSX4W7AAZD4Q7NUSRHTVH2CR3ANCNFSM5UZNQIKA . You are receiving this because you authored the thread.Message ID: @.***>

quanghuy0497 commented 2 years ago

Dear Nawaffahan,

For "downsampling" a feature matrix (e.g. from 512x512 to 32x32) we normally use Convolutional layer with Convolutional Operation, which is a basic of CNN.

On the otherhand, for "upsampling" a feature matrix (e.g. from 7x7 to 14x14), we can use the Deconvolutional Layer, which is an inverse to Convolutional.

This deConv layer/operation is quite popular in Segmentation architectures. You can read the papers about Fully Convolutional Network or U-Net for better intuition.

In case you have any further question, please read the original paper about Mask R-CNN and comparing it with the implementation code beforehand.

Best,

Nawaffarhan commented 2 years ago

My question the small object in coco where less than 32x32. And im mask rcnn the scale ration is 4 for 256 and roi align used convert the region to 14x14. My question is if I have small object (28x28) will be 28/4 is 7 how to roi align will converted to fixed size which is 14x14 and I read the code of maskrcnn don't used upsampling in this case.

On Mon, May 9, 2022, 11:59 AM Quang-Huy Nguyễn @.***> wrote:

Dear Nawaffahan,

For "downsampling" a feature matrix (e.g. from 512x512 to 32x32) we normally use *Convolutional layer with Convolutional Operation.

On the otherhand, for "upsampling" a featrue matrix (e.g. from 7x7 to 14x14), we can use the Deconvolutional Layer, which is an inverse to Convolutional.

This layer/operation is quite popular in Segmentation task. You can read the papers about Fully Convolutional Network or U-Net for better intuition.

— Reply to this email directly, view it on GitHub https://github.com/quanghuy0497/Mask_R-CNN/issues/2#issuecomment-1120833868, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJOADSWE2IPYQMCSLM56VJLVJDHYXANCNFSM5UZNQIKA . You are receiving this because you authored the thread.Message ID: @.***>