vllm Search Results - Githubissues

1000+ results
for vllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #5663

[Bug]: Qwen2-72B-Instruct-gptq-int4 Repetitive issues

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug Machine A800, VLLM 0.5.0, PROMPT=开始, output max tokens = 2048, Temperature sets 0.7 VLLM…

Storm0921 updated 3 months ago
1
sasha0552/vllm-ci #6

How to test if patch applied successfully?

I try to apply the triton patch like this: `pip3 install --extra-index-url https://sasha0552.github.io/vllm-ci/ --force-reinstall triton` Which shows ``` pip3 install --extra-index-url https…

cduk updated 2 months ago
3
QwenLM/Qwen-Agent #236

开源的QWen2模型，是否支持spring AI 的 function call

如题，搭建两种部署环境： 1. vllm +qwen 2 2. ollama + qwen2 , 请问这两种部署方式，使用spring ai 来调用 qwen，能否使用function call？

zzllkk2003 updated 3 months ago
5
vllm-project/vllm #6479

[New Model]: Codestral Mamba

### The model to consider. Mamba Codestral: https://huggingface.co/mistralai/mamba-codestral-7B-v0.1 Highlights: - SOTA 7B code model - theoretically unlimited context length; tested up to 256k …

K-Mistele updated 2 months ago
1
vllm-project/vllm #7514

[Bug]: error while attempting to bind on address ('0.0.0.0',…

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### 🐛 Describe the bug Hello, On a container env I …

githebs updated 1 month ago
3
modelscope/evalscope #79

OpenCompass Eval-Backend 不能用自定义的数据集吗？

我在llmuses/benchmarks 按照格式定义了一个数据集，如何来验证vllm部署的模型？Native 模式没找到传模型地址的地方，OpenCompass模式自定义的数据集不支持

jackqdldd updated 2 months ago
2
vllm-project/vllm #8531

[Bug]: benchmark_serving.py generates different numbers of t…

### Your current environment 4xH100. ### Model Input Dumps _No response_ ### 🐛 Describe the bug When benchmarking the performance of vllm with `benchmark_serving.py`, it will generate different…

LiuXiaoxuanPKU updated 2 weeks ago
1
intel-analytics/ipex-llm #11360

vLLM CPU example load-in-low-bit is not used

During testing with the --load-in-low-bit features with the vLLM for CPU example. I noticed the model is not optimized based on this option. I found that it needs to pass in the load_in_low_bit ar…

noobHappylife updated 3 months ago
2
vllm-project/vllm #7136

[Bug]: 单gpu没有任何反应（设置tensor_parallel_size=1模型加载失败）

### Your current environment 问题 ### 🐛 Describe the bug ```python from vllm import LLM, SamplingParams from transformers import AutoTokenizer import torch # Initialize the tokenizer tokeniz…

efficentdet updated 3 weeks ago
3
containers/podman #23566

Cannot start containers while offline with pasta

### Issue Description While using pasta for container networking, and not connected to the internet, podman-run always fails. Alternatively, using `--network=slirp4netns` or `--network=none` works…

ralmachado updated 1 week ago
7

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for vllm

1000+ results
for vllm