Open dreamerlin opened 1 month ago
8卡跑的
顺带,这行代码是不是有问题 https://github.com/InternLM/lmdeploy/blob/main/lmdeploy/vl/model/phi3_vision.py#L61 是不是应该是
temp_glb_GN = self.glb_GN.repeat(1, H // 2, 1, 1)
我自己改了代码后(只改了和 device 有关的代码),跑8k with 2 images,做 text needle 任务,输出有问题
你们确保 phi 的代码逻辑没错误嘛
@dreamerlin hi, it seems that the implementation in lmdeploy is based on the old version of the phi3 model, see this commit https://huggingface.co/microsoft/Phi-3-vision-128k-instruct/commit/866d1691437a49af79d5f3ad4a34c1750e08d163 . we may update it later. BTW. Could you provide the sample codes with image files to reproduce? THX
Checklist
Describe the bug
Reproduction
backend_config = PytorchEngineConfig(tp=8, session_len=session_len) pipe = lmdeploy.pipeline(args.checkpoint, backend_config=backend_config, chat_template_config=ChatTemplateConfig(model_name='phi-3'))
Environment
Error traceback