vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #8158

[Bug]: vllm async engine can not use adag

### Your current environment PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.2 LTS (x86_64) GCC version: (U…

Bye-legumes updated 3 days ago
1
langchain-ai/langchain #23814

BadRequestError with vllm locally hosted Llama3 70B Model

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the [LangGraph](https://langchain-ai.github.io/langgraph/)/LangChain documentation with the integrat…

Haxeebraja updated 5 days ago
3
vllm-project/vllm #8139

[New Model]: Qwen2-VL

### The model to consider. https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct ### The closest model vllm already supports. https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/qw…

krevas updated 3 days ago
3
Lightning-AI/pytorch-lightning #19829

How to incorporate vLLM in Lightning for LLM inference?

### Description & Motivation [vLLM](https://github.com/vllm-project/vllm) is one of the most popular and effective tool for quick, large-scale LLM inference. Are there any existing examples of incorp…

YuWang916 updated 4 days ago
3
QwenLM/Qwen2-VL #113

Qwen2-VL agent 时候支持视频作为输入

``` llm_cfg = { # Use the model service provided by DashScope: 'model': 'qwen-vl-max-0809', #'api_key': 'YOUR_DASHSCOPE_API_KEY', # It will use the `DASHSCOPE_API_KEY' environment…

Cherryjingyao updated 2 days ago
4
vllm-project/vllm #8212

[Bug]: FastAPI 0.113.0 breaks vLLM OpenAPI

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... WARNING 09-05 21:11:49 cuda.py:22] You are using a deprecated `pynvml` package.…

drikster80 updated 18 hours ago
2
triton-inference-server/server #6583

Support for vLLM and TRT-LLM running in OpenAI compatible mo…

**Is your feature request related to a problem? Please describe.** I'd like to be able to run vLLM emulating the OpenAI compatible API to use vLLM as a drop-in replacement of ChatGPT. **Describe…

vecorro updated 3 days ago
16
junhwi/next-gen-ai #39

24/08/25

PyTorch is dead. Long live JAX. https://neel04.github.io/my-website/blog/pytorch_rant/ LLM Compressor https://github.com/vllm-project/llm-compressor https://neuralmagic.com/blog/llm-compressor-i…

shylee2021 updated 1 week ago
2
EricLBuehler/candle-vllm #62

Using candle-vllm as crate in rust?

Hi Eric, great rust programm. I am looking for a crate so I can use a chatbot function within my rust programm. I tried to to that with candle. I hope it will be more documented in den future. …

gkvoelkl updated 1 month ago
1
microsoft/MInference #43

[Question]: Why is running MInference/examples/run_vllm.py n…

### Describe the issue ```python from vllm import LLM, SamplingParams from minference import MInference prompts = [ "Hello, my name is", "The president of the United States is", …

zjjznw123 updated 1 month ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for vllm

1000+ results
for vllm