-
I am running the [phi3 vision directml](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct-onnx-directml/tree/main/directml-int4-rtn-block-32) tutorial [code](https://onnxruntime.ai/docs/gena…
-
### Motivation
The latest release of microsoft phi3 4.2b 128k context vision model looks promising in performance and resource saving one too as it boast just 4.2b parameter. So it would be a great f…
-
### This issue is for a: (mark with an `x`)
```
- [ ] bug report -> please search issues before submitting
- [x] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior …
-
## 🐛 Bug
## To Reproduce
Using this model [Phi-3-vision-128k-instruct](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct)
I got some bugs, need your help !!!
For phi3-v problem, w…
-
Across different LMMs the max new token is different .
I believe we should have a consistent MAX_NEW_TOKENS across the project, set to 512 or 1024
If it makes sense, I can create a PR to modify al…
-
### The model to consider.
from typing import List
from fastapi import FastAPI, HTTPException
from pydantic import BaseModel
from PIL import Image
from vllm import LLM, SamplingParams
import os
…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related issue y…
-
When I finetuned the model, I faced this problem. I have downloaded the segmentation dataset from Huggingface and unzipped them. It seems that I need more processing for the segmentation dataset. What…
-
Can this work for multiple images?
-
https://huggingface.co/Qwen/Qwen-VL-Chat/tree/main
https://huggingface.co/deepseek-ai/deepseek-vl-7b-chat
I've gotten extremely good results off of these, would be great to have them baseline in…