Closed hmxiong closed 10 months ago
Thanks for your attention. For your questions,
Thank you very much for the guidance. I think the main reason may be the mismatch of LLaMA weights. Although I use the delta weight of V0, I will change the data set and try again to see if it can solve the problem of recurrence. At this stage when I used the official ckpt test, I found that the results would fluctuate to a certain extent.
I don’t know if you can provide a method for making Fintune data. The main purpose here is to ask how to get all the options when testing on the ScanQA data set (how to convert QA questions into multiple-choice questions) and how to convert ScanNet Detection bounding box information to text format
Sorry for late reply. We can upload finetuning data after security checking. Here is how we build the finetuning data from existing dataset.
For ScanQA, we used Chat GPT API and ask it to generate some related and confusing options based on the given ground truth, and then combine all the options as all options of the question. For ScanNet boudning boxes, we represent the bounding box with 6 numbers, which is x,y,z coordinates of center and lengther of edges. And we prompted ChatGPT to generate some templates that can link class label and bounding boxes into a sentence, which is similar for cases of 2D detection.
Hope this can solve your problems.
I don’t know if you can provide a method for making Fintune data. The main purpose here is to ask how to get all the options when testing on the ScanQA data set (how to convert QA questions into multiple-choice questions) and how to convert ScanNet Detection bounding box information to text format
For your reference, finetuning data for ScanQA multiple choice is available on huggingface.
Sorry to bother you again. At present, I have used the official code to complete the benchmark test experiment. Here are my questions: