shikras / shikra

Other
710 stars 44 forks source link

How is the toy shikra trained in Table 2? #20

Closed qumengxue closed 11 months ago

qumengxue commented 11 months ago

I find this project extremely interesting and I'm eager to follow its progress. I have a question regarding the training process mentioned in the paper. The paper refers to "toy shikra/toy model" many times. I'm curious to know how the toy shikra was trained, particularly the results mentioned in Table 2. Was it trained only with REC datasets and initialized from the llama model?

kq-chen commented 11 months ago

it is trained on a combination of three REC datasets and initialized from the llava-7b model

qumengxue commented 11 months ago

Got it, thanks for your reply!