Closed watertianyi closed 3 weeks ago
@shirowalker Because a test object needs to be photographed six times with different light sources to show defects, which is equivalent to collecting six images. Do you have any better way to do this? Because different defects appear in different forms under different light sources, and the reaction Data diversity
@watertianyi It sounds like images under different lighting conditions can be separately analyzed, and the results of six detections can be unified to make a judgment. If calculating AUC-ROC, the maximum value of the six anomaly maps can be taken. If it is binary classification, an AND(&) operation can be performed (if one of the images is abnormal, the object is judged to be abnormal).
@shirowalker Is it more time-consuming to reason with six pictures than with a single picture?
@watertianyi Yes, it seems that we can only process the six images separately. Even if we treat them as multiple channels and process them together, the computational load won't be reduced.
This code is difficult to handle data other than RGB images. Mask labeling needs to be processed on RGB images using SAM, and the other pre-trained models in the code are also unable to handle 6-channel data.