Closed ys-zong closed 2 months ago
Hi, thanks for the nice work! I wonder if InternLM-4KHD supports interleaved image-text (e.g. multiple images) inputs for inference like InternLM-XComposer?
hi, the model has such capability but is not good at it, as we do not train the model with interleaved data
@ys-zong Hi, you may try to concatenate multiple images into a sinlge large image and ask the question.
Hi, thanks for the nice work! I wonder if InternLM-4KHD supports interleaved image-text (e.g. multiple images) inputs for inference like InternLM-XComposer?