Closed DaiZhewei closed 1 month ago
Sorry I do not get what 'TI' represents. But for all the training process, we always use the first 1/3 of the data.
Sorry I do not get what 'TI' represents. But for all the training process, we always use the first 1/3 of the data.
It refers to the Textual Inversion method that you used to generate different masks.
Sorry I do not get what 'TI' represents. But for all the training process, we always use the first 1/3 of the data.
It refers to the Textual Inversion method that you used to generate different masks.
If so, we only use the first 1/3 of the masks.
Thanks, I have one more question, the image of the training sample given in the paper is the 7th corresponding to the wood_color class (there are 8 samples in this class) and does not belong to the first 1/3 of the data mentioned
This experiment is not the final version, where we train the models by randomly selecting 1/3 of the data. Since it is only for qualitative comparison, we keep it to show the comparison results. Sorry for bringing such confusion to you.
Dear Hu Teng,
I observed there is red color on the generated images in MetalNuts, However there is not red color in the training set. Could you please clarify this issue?
Dear Hu Teng,
I observed there is red color on the generated images in MetalNuts, However there is not red color in the training set. Could you please clarify this issue?
For this extremely biased dataset, we use crop-paste to crop the red color from 012.png and paste it on 000.png with the mask of 000.png manually. Since this is not a common case, we do not mention it in the paper and code. And actually, the generated red color does not improve the anomaly detection accuracy obviously.
Dear Hu Teng,
Thank you so much for your kindness. For How many classes you did this? Could you please clarify? I observed similar things in carpet and wood.
Hi, thank you very much for your answer. I have three more questions:
Hi, thank you very much for your answer. I have three more questions:
- for object class defects, how to make sure that the randomly generated mask is in the corresponding position of the object, not in an unreasonable area or background area. I have read about mask filtering operation in other issues, and I would like to ask how to filter it, because the object area is different in each image.
- for the logic class defects (cable swap, metal_nut flip and transistor misplaced), is there any special treatment done and how is the reasoning process implemented.
- we downloaded the provided segmentation model for unet weights and found that unet handles the segmentation of logic class defects very well, would like to ask how this is achieved?
Dear Hu Teng,
Thank you so much for your kindness. For How many classes you did this? Could you please clarify? I observed similar things in carpet and wood.
I need to recount it carefully. But I'm quite so busy before ACM MM 2024, I will rely you once I have enough free time or after ACM MM. Thanks for your patience.
Thank you @sjtuplayer, for your kind response.
Hi author, I would like to ask one question, did you use all the masks in the test set when training the TI or only the first 1/3 of the masks? Looking forward to your answer, thanks!