lora-server Search Results

1000+ results
for lora-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #10086

[Feature]: Enhance integration with advanced LB/gateways wit…

### 🚀 The feature, motivation and pitch There are huge potential in more advanced load balancing strategies tailored for the unique characteristics of AI inference, compared to basic strategies such …

liu-cong updated 5 days ago
3
vllm-project/vllm #7260

[Bug]: lora response's moel name is incorrect

### Your current environment ```text vllm 0.5.4 ``` ### 🐛 Describe the bug 1. start vllm server `python -m vllm.entrypoints.openai.api_server --served-model-name qwen2 --model /ai-deploy/open-m…

NiuBlibing updated 4 days ago
2
vllm-project/vllm #9133

[Bug]: Unable to use --enable-lora on latest vllm docker con…

### Your current environment ```text podman --version podman version 5.2.3 uname -a Linux noelo-work 6.10.12-200.fc40.x86_64 #1 SMP PREEMPT_DYNAMIC Mon Sep 30 21:38:25 UTC 2024 x86_64 GNU/L…

noelo updated 1 week ago
2
raysers/Mflux-ComfyUI #5

Lora 加载报错

# ComfyUI Error Report ## Error Details - **Node Type:** QuickMfluxNode - **Exception Type:** RuntimeError - **Exception Message:** [load_safetensors] Failed to open file /Users/wengyinghui/Comf…

tolozine updated 3 weeks ago
4
huggingface/text-generation-inference #2715

`FlashLlamaForCausalLM`'s using name `dense` for its mlp sub…

Hi there! 🤗 `FlashLlamaForCausalLM` uses name `dense` for its MLP submodule and when user wants to employ a LoRA adapter, `get_mlp_weights` skips this submodule. https://github.com/huggingface/t…

sadra-barikbin updated 2 weeks ago
1
vllm-project/vllm #3219

Performance issue when loading lora modules

I compared two ways to launch the server. The model is vicuna-7b, and GPU is 2 \* A30. and the 1st way is ``` python -m vllm.entrypoints.openai.api_server \ --model /data/models/vicuna-…

sleepwalker2017 updated 3 weeks ago
4
InternLM/lmdeploy #2686

[Docs] LoRA 推理服务

### 📚 The doc issue 请问使用lmdeploy serve api_server THUDM/chatglm2-6b --adapters mylora=chenchi/lora-chatglm2-6b-guodegang 启动服务之后，调用的时候能不能即使用裸模型也使用lora训练后的？比如openai方式调用的时候model_name=mylora就是调用adpte…

LIUKAI0815 updated 3 weeks ago
1
richonguzman/LoRa_APRS_iGate #195

Heltec Lora 32 V2 LCD not working

Hi, I installed version 2024.11.16 on a board Heltec Lora 32 V2 and the LCD is not working although it is configured. `"display": { "alwaysOn": true, "timeout": 5, "turn180": false },` ![2024-1…

mprilepok updated 3 days ago
25
ATP-1010/FederatedLLM #2

A question about distributing global lora modules

As stated in your paper, the server distributes the stacked global LoRA module to each client, but how each client convert this global module into the local module with a lower rank?

bihaizhang updated 3 days ago
7
vladmandic/automatic #3502

[Feature]: xyz grid support multi-dimensional search&replace

### Issue Description xyz search and replace with only x activated, works fine, i.e. it applies lora , then Prompt is: ``` photo of man on the street comic book style , ``` …

SAC020 updated 5 days ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for lora-server

1000+ results
for lora-server