Closed Quest2GM closed 1 month ago
We only use the caption (i.e. "from": "gpt" The image captures a scene in a desert-like setting ...) in training. If you want to also use the questions, you may ask the author of ShareGPT4V for further details.
I was referring to the image tag in this line here:
"value": " \<image>\nCan you describe the main features of this image for me?"
Why is it this \<image> used here? sometimes it is placed before the question sometimes placed after the question
Oh sorry, I misunderstood what you said. I see that you said that you don't use the question. Thanks for the quick response.
Hi
I am looking to create my own dataset to finetune this model. In the share-captioner_coco_lcs_sam_1246k_1107.json file, I noticed that there were these "image" tags at the beginning or at the end of the question? Are these image tags important? If so, how do you know when it should be placed before the prompt or after the prompt?
Like this:
Thanks for the great work, and would appreciate a quick response.