Closed ghost closed 2 years ago
Hi, They use two different methods for localization. GradCam and Patch Heatmap. The GradCam method does not require retraining and achieves 88.3 AUC. The Patch Heatmap scores 96.0 and requires training on 64×64 patches as described in Section 2.4:
[...] we learn a representation of an image patch us- ing CutPaste prediction, as in Section 2.4. We train mod- els of 64×64 patches from 256×256 image.
Does CutPaste need to train two models for defect detection and localization?
Depends on the localization method you want to use. With GradCam no. With the patch based method yes.
Another repo also implements these localization and might be interesting to you: https://github.com/LilitYolyan/CutPaste
Hope this helps your understanding
Thank you for your prompt reply and detailed explanation! LilitYolyan they implemented the localization part in the dataset.py, thanks for sharing!
According to the paper in section 4.2, it said
and
Also in section 2.4,
And the training objectives defined in formulas (1) and (3) are different.
Does CutPaste need to train two models for defect detection and localization?
It seems that they trained an image-level detector (3-way), the detection result achieves 95.2 while localization result only achieves 88.3. And they need to train another model using patches as input to get localization result of 96.0. Please correct me if I was wrong.