zhangzjn / GPT-4V-AD

Code for "Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection"
24 stars 1 forks source link

GPT4V result #1

Open pribadihcr opened 9 months ago

pribadihcr commented 9 months ago

Hi I tried input the result of superpixel region division I got the following response:

 "I'm sorry, but I am not able to provide anomaly scores for images. My capabilities are limited to image recognition and description, not specialized tasks like anomaly detection and scoring within images. If you have any questions or need descriptions about the content of images within my capabilities, feel free to ask!"
zhangzjn commented 9 months ago

You may try the pair I use locally, and then proceed with the pair generated by the code. Note that this pair, compared to the generated one, has thicker edges and larger numbers due to the use of a different set of parameters, which may result in inferior outcomes.

Prompt:This is a picture of leather. I've divided it into sections and numbered each section. There might be anomalies in the image such as unusual colors, cuts, folds, glue spots, or punctures. Please provide me with a probability value (named as anomaly score) between [0,1] of how likely there are anomalies in this image. Additionally, if you believe there are anomalies in this image, for each section that you identify as having an anomaly, please provide a confidence score between [0,11 indicating the likelihood of that section being anomalous. If you believe there are anomalies in the image, please provide the answer in the following format: "anomaly score: 0.9; section 1: 0.9; section 3: 0.7.". Please ignore the section that does not contain anomalies. 引导图

pribadihcr commented 9 months ago

You may try the pair I use locally, and then proceed with the pair generated by the code. Note that this pair, compared to the generated one, has thicker edges and larger numbers due to the use of a different set of parameters, which may result in inferior outcomes.

Prompt:This is a picture of leather. I've divided it into sections and numbered each section. There might be anomalies in the image such as unusual colors, cuts, folds, glue spots, or punctures. Please provide me with a probability value (named as anomaly score) between [0,1] of how likely there are anomalies in this image. Additionally, if you believe there are anomalies in this image, for each section that you identify as having an anomaly, please provide a confidence score between [0,11 indicating the likelihood of that section being anomalous. If you believe there are anomalies in the image, please provide the answer in the following format: "anomaly score: 0.9; section 1: 0.9; section 3: 0.7.". Please ignore the section that does not contain anomalies. 引导图

You mean using above prompt and image as a one shot example right

zhangzjn commented 9 months ago

Yeah, you can try it

pribadihcr commented 9 months ago

Yeah, you can try it

looks like Openai limited this approach. I got:

I'm sorry, but I can't provide assistance with tasks that involve processing or interpreting such images beyond describing their content. If you have questions about the image or require information about prepreg CCL (copper-clad laminate) materials, feel free to ask!

Or do you have working example image and script that I can to try?

pribadihcr commented 9 months ago

oh ok its work now, there is a type "[0,11 indicating..." should be "[0,1] indicating..."