lora-server Search Results

1000+ results
for lora-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #2441

Regarding server performance with LoRA

### System Info GPU: Nvidia H100 Model: Llama3 8B ### Who can help? @kaiyux ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially suppo…

binhtranmcs updated 4 days ago
6
ggerganov/llama.cpp #10377

Feature Request: Apply LoRA adapters per-request

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

ngxson updated 6 days ago
1
vllm-project/vllm #10062

[Performance]: Throughput and Latency degradation with a si…

### Proposal to improve performance _No response_ ### Report of performance regression _No response_ ### Misc discussion on performance --- **Setup Summary for vLLM Benchmarking with Llama…

kaushikmitr updated 2 days ago
9
PaulSchulz/esphome-lora-sx126x #7

Heltec Lora 32V3.2 not compile

Config: ``` esphome: name: lora32-home friendly_name: Lora32_home libraries: - "SPI" - "Ticker" - "SX126x-Arduino" esp32: board: esp32-s3-devkitc-1 framework: …

userosos updated 1 day ago
3
NVIDIA/TensorRT-LLM #2434

Error in data types: using model with lora

### System Info a100 ### Who can help? @byshiue @juney-nvidia ### Information - [ ] The official example scripts - [x] My own modified scripts ### Tasks - [x] An officially supported task in th…

Alireza3242 updated 1 week ago
1
arcee-ai/mergekit #459

Qwen2.5 LoRA Extraction not working in vLLM & Aphrodite Engi…

Usually you can use LoRA extraction in mergekit and then run the LoRAs in vLLM or Aphrodite Engine just fine. This works for Llama and Mistral models so far, but it seems like this isn't working for Q…

Nero10578 updated 1 day ago
2
PaulSchulz/esphome-lora-sx126x #6

Issue Lorawan

Updating /config/esphome/lorawan.yaml ------------------------------------------------------------ INFO ESPHome 2024.10.3 INFO Reading configuration /config/esphome/lorawan.yaml... ERROR Unexpec…

rallep71 updated 3 days ago
2
vllm-project/vllm #10429

[Bug]: rocm issue

### Your current environment AMD radon + kubernetes ### Model Input Dumps `vllm serve mistralai/Mistral-7B-Instruct-v0.3 --trust-remote-code --enable-chunked-prefill --max_num_batch…

YYXLN updated 5 days ago
1
Acly/krita-ai-diffusion #1423

An exception after update and now SDXL isn't working, worklo…

It was working fine until the update. But now this. Exception: ``` KeyError Python 3.10.7: D:\Graphics\Krita (x64)\bin\krita.exe Tue Nov 19 20:18:55 2024 A problem occurred in a Python scrip…

GeminiSquishGames updated 2 days ago
2
sgl-project/sglang #1921

[Bug] Make multi-lora serving compatible with cuda graph and…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

LIUKAI0815 updated 2 days ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for lora-server

1000+ results
for lora-server