sglang Search Results - Githubissues

776 results
for sglang

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

QwenLM/Qwen2.5 #474

[BUG] 用flask部署的qwen可以支持langchain的ChatOpenAI吗？如何设置

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing…

chuangzhidan updated 5 months ago
2
flashinfer-ai/flashinfer #157

Google Gemma running error with half dtype

Using `flashinfer` in `sglang` with `google/gemma-7b-it` ```sh File "/home/ubuntu/sglang-venv/lib/python3.11/site-packages/flashinfer/prefill.py", line 462, in forward return self._wrapper.…

hnyls2002 updated 7 months ago
2
vllm-project/vllm #5457

[Installation]: M2 Mac Dependency Torch 2.1.2 (Incompatible)…

### Your current environment PyTorch version: 2.3.1 Is debug build: False CUDA used to build PyTorch: None ROCM used to build PyTorch: N/A OS: macOS 14.1.1 (arm64) GCC version: Could not colle…

velocity33 updated 4 months ago
9
EricLBuehler/mistral.rs #59

Model grammar support via BNF

We will implement based on [this](https://github.com/ggerganov/llama.cpp/blob/master/grammars/README.md). The idea is as follows, given parsed BNF. 0) While the model is calculating the logits, …

EricLBuehler updated 6 months ago
20
xorbitsai/inference #2084

xinference环境无法加载openai sdk产生临时图片链接

### System Info / 系統信息 (Xinference) root@Server-058266b4-896f-4df5-9763-65d3e241d655:~# pip list Package Version --------------------------------- ------------ accelerate…

weiwuxian1998 updated 2 months ago
1
huggingface/text-generation-inference #1669

Multi-modal model support

### Feature request Increase support for multi-modal models going forward. Llava 1.6 is one option, but waiting for whatever best model comes out next (IDEFICS 2?) would be fine too. ### Motivation …

RonanKMcGovern updated 6 months ago
1
sgl-project/sglang #362

Fails with latest vllm

sglang is installing the latest vllm, looks like this file was removed from vllm 0.4.0 `ModuleNotFoundError: No module named 'vllm.model_executor.input_metadata`

benglard updated 6 months ago
1
sgl-project/sglang #359

How do I instantiate multiple sgl backends?

I see the option for `sgl.set_default_backend()` but this seems to be a global setting. Is there a way to get multiple backends running and pick which one is used?

accupham updated 6 months ago
2
InternLM/lmdeploy #1407

[Feature] Turbomind engine prefix caching

### Motivation Prefix caching is supported in many projects such as vllm, sglang and rtp-llm. Torch engine is going to support this feature in https://github.com/InternLM/lmdeploy/pull/1393. So we ra…

ispobock updated 5 months ago
18
sgl-project/sglang #274

Cannot Execute Runtime Directly in Docker, with local instal…

I'm running the runtime directly, like so: ``` SGLANG_PORT, additional_ports = handle_port_init(30000, None, 1) RUNTIME = sgl.Runtime( model_path=model_path, port=SGLANG_PORT, addi…

lucasavila00 updated 7 months ago
5

上一页 1...64 65 66 67 68 69 70...78 下一页

776 results for sglang

776 results
for sglang