multimodal-chatbot Search Results

vllm-project/vllm #10718

[Bug]: documentation say XLMRobertaForSequenceClassification…

### Your current environment I install using docker swarm on dedicated cloud vps on hetzner, I want run a lightweight model "jinaai/jina-embeddings-v3", I assume that the cpu and ram i sufficient in …

devMls updated 2 days ago

gradio-app/gradio #8904

multimodal chatbot custom component broken; normalise_file u…

### Describe the bug making a custom chatbot component using multimodal chatbot and latest @gradio npm install, and the normalise_file function that is being used doesn't exist ### Have you searched…

ajayarora1235 updated 4 months ago

gradio-app/gradio #10039

LaTeX formulas are not displayed correctly

### Describe the bug ![image](https://github.com/user-attachments/assets/a8d4a6ee-db42-4892-9eba-db08b8418601) 计算公式分辨力 ( r )： [ r = \frac{1}{\bar{U}} \times 100% ] 非线性度 ( \delta_1 )： [ \delt…

jason571 updated 4 days ago

vllm-project/vllm #9633

[Usage]: Multimodal content with benchmark_serving.py

### Your current environment I am running vllm serve with a multimodal (Phi3.5K). How to I run benchmark_serving.py to test the multimodal? In benchmark_serving.py file I see following but test_mm…

khayamgondal updated 1 month ago

vllm-project/vllm #9324

[Feature]: Quantization support for LLaVA OneVision

### 🚀 The feature, motivation and pitch I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…

salvaba94 updated 1 month ago

vllm-project/vllm #10290

[Feature]: Chunked prefill for multimodal models

### Your current environment [pip3] numpy==1.25.1 [pip3] nvidia-cublas-cu12==12.4.5.8 [pip3] nvidia-cuda-cupti-cu12==12.4.127 [pip3] nvidia-cuda-nvrtc-cu12==12.4.127 [pip3] nvidia-cuda-runtime-…

QiuJingkai updated 2 weeks ago

vllm-project/vllm #8385

[Feature]: MultiModal benchmark_latency, benchmark_throughpu…

### 🚀 The feature, motivation and pitch - multimodal feature to benchmark offline latency, throughput and online serving for multimodal for pixtral ### Alternatives - everyone writes thei…

OrenLeung updated 2 months ago

vllm-project/vllm #9546

[Usage]: Qwen2VL model mrope implemenation in cuda graph

### Anything you want to discuss about vllm. in qwen2vl's mrope imple, vllm decide whether input positions is for multimodal with ![image](https://github.com/user-attachments/assets/6dfc96d9-5162-…

gujiewen updated 1 month ago

ollama/ollama #2169

Inference with OpenVINO on Intel

I think Intel CPUs/GPUs now support more efficient inference with OpenVINO. See example here with LLAVA: https://docs.openvino.ai/2023.2/notebooks/257-llava-multimodal-chatbot-with-output.html It …

ddpasa updated 2 weeks ago

vllm-project/vllm #7848

[Bug]: un-deterministic image processor file downloading err…

### Your current environment vllm == 0.5.5. ### 🐛 Describe the bug when we deploy the `microsoft/Phi-3.5-vision-instruct`, it will randomly hit this issue. ``` [1;36m(VllmWorkerProcess p…

gyin94 updated 3 days ago

210 results for multimodal-chatbot

210 results
for multimodal-chatbot