llm-serving Search Results

1000+ results
for llm-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Significant-Gravitas/AutoGPT #7122

agent have started ,but the website is empty,and in the gitp…

### ⚠️ Search for existing issues first ⚠️ - [X] I have searched the existing issues, and there is no existing issue for my problem ### Which Operating System are you using? Linux ### Which versio…

Iamasdf updated 4 months ago
3
vllm-project/vllm #1237

Data parallel inference

Is there a recommended way to run data parallel inference (i.e. a copy of the model on each GPU)? It's possible by hacking CUDA_VISIBLE_DEVICES, but I was wondering if there's a cleaner method. ```py…

kevinhu updated 2 months ago
18
vllm-project/vllm #8978

[Usage]: Serving Llama 3.2 `llama-3-2-11b-vision-instruct` h…

### Your current environment ```text The output of `python collect_env.py` ``` ``` :128: RuntimeWarning: 'torch.utils.collect_env' found in sys.modules after import of package 'torch.utils', bu…

rchen19 updated 1 month ago
8
InternLM/lmdeploy #1645

Wonderful work!，请问有关于VLM类模型api_server的性能测试内容吗？

### 📚 The doc issue ![image](https://github.com/InternLM/lmdeploy/assets/62475359/749182ac-fb3f-43d0-bbae-09c219ac0c40) 如题，我使用llava-1.5-13b模型，直接跑文档的[api_server 性能测试]报错 ![image](https://github.com/I…

red-fox-yj updated 6 months ago
2
langchain-ai/langchain #14757

Langserve example from Quickstart tutorial not working

### System Info Name Version Build Channel langchain 0.0.350 pypi_0 pypi langchain-cli 0.0.19 …

AvijeetPrasad updated 3 months ago
10
modal-labs/modal-examples #763

Error when Running `vllm_inference.py`: `CancelledError()

I have encountered an issue when attempting to run the `vllm_inference.py` script from the Modal Examples repository. Below are the steps I followed and the error I encountered: ### Steps to Reprod…

chaosisnotrandomitisrhythmic updated 5 months ago
5
NVIDIA/TensorRT-LLM #492

What does `no_repeat_ngram_size` exactly do?

First of all, thanks for this amazing package! **Context:** We're experimenting with running some rather unruly LLMs (i.e. they love repeating themselves in some cases). Due to the nature of our t…

TeodorPoncu updated 5 months ago
6
mediar-ai/screenpipe #445

[bug] pipe crashing at deno runtime level ($200)

https://github.com/denoland/deno_core/issues/898 /bounty 200 definition of done: - does not crash anymore

louis030195 updated 3 weeks ago
22
vllm-project/vllm #2308

performance and concurrency questions

I've done some experiments with vllm and read through the docs, but have not been able to get higher performing systems. I have a couple of questions. 1) Will using vllm on linux with a 4090 get fas…

jtoy updated 2 months ago
7
kubernetes-sigs/lws #132

Supports setting different commands/liveness for leader pod …

**What would you like to be added**: Supports setting different commands/liveness for leader pod and other pods within a group **Why is this needed**: There are multiple LLM frameworks that s…

gujingit updated 6 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llm-serving

1000+ results
for llm-serving