About Multi-turn Conversation

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Apache License 2.0

1.85k stars 129 forks source link

Yes, you are correct. When we feed conversation history into the model, it will output the multi-turn conversation result. Currently, we do not have multi-turn conversation data containing <SEG> token. The VQA data (e.g., llava-instruction-150k) contains multi-turn conversation, but it does not involve <SEG> token and the segmentation task. As a result, the model can handle some simple multi-turn cases currently as shown in Fig.1 of the paper. We are working on improving such ability of LISA.

dvlab-research / LISA

About Multi-turn Conversation #21