-
> Please provide us with the following information:
> ---------------------------------------------------------------
### This issue is for a: (mark with an `x`)
```
- [ ] bug report -> please…
-
Huggingface Model: https://huggingface.co/microsoft/Phi-3.5-vision-instruct
Fine-tuned Dataset: https://huggingface.co/datasets/linxy/LaTeX_OCR
Usually, fine-tuning a multimodal large model invo…
-
## 🐛 Bug
## To Reproduce
Using this model [Phi-3-vision-128k-instruct](https://huggingface.co/microsoft/Phi-3-vision-128k-instruct)
I got some bugs, need your help !!!
For phi3-v problem, w…
-
Bravo for your work, there is potential in this project!
I would like to use this project to create a compatible openai api for the model: [phi-3-vision-128k-instruct](https://onnxruntime.ai/docs/g…
-
### The Feature
Support was added for all Nvidia NIM LLM models (integrate.api.nvidia.com addresses), but so far, I don't believe support was added for the VLM models (ai.api.nvidia.com addresses).
…
-
Does this project support the training and inference of multi-modal retrieval models, such as Phi-3-vision? I'd like to reproduce the experiments in paper https://arxiv.org/abs/2406.11251 based on thi…
-
- [x] MiniCPM-Llama3-V-2_5
- [x] Florence 2
- [x] Phi-3-vision
- [x] Bunny
- [x] Dolphi-vision-72b
- [x] Llava Next
- [ ] Idefics 3
- [ ] Llava Interleave
- [ ] Llava onevision
- [ ] internlm…
-
https://huggingface.co/microsoft/Phi-3-medium-128k-instruct
https://huggingface.co/microsoft/Phi-3-medium-4k-instruct
https://huggingface.co/microsoft/Phi-3-small-8k-instruct
https://huggingf…
-
### Describe the issue
When traying to run basic sample, form the Phi 3 Cookbook [https://github.com/microsoft/Phi-3CookBook/blob/main/md/07.Labs/Csharp/src/LabsPhi301/Program.cs](https://github.com/…
-
I can see that there are multiple issues of the form "add X as a new OCR engine":
- #17
- #18
- #19
- #36
... therefore would it be sensible to document the steps and / or rearchitect such that t…