lora-server Search Results

NVIDIA/TensorRT-LLM #2441

Regarding server performance with LoRA

### System Info GPU: Nvidia H100 Model: Llama3 8B ### Who can help? @kaiyux ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially suppo…

binhtranmcs updated 4 days ago

ggerganov/llama.cpp #10377

Feature Request: Apply LoRA adapters per-request

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

ngxson updated 6 days ago

vllm-project/vllm #10062

[Performance]: Throughput and Latency degradation with a si…

### Proposal to improve performance _No response_ ### Report of performance regression _No response_ ### Misc discussion on performance --- **Setup Summary for vLLM Benchmarking with Llama…

kaushikmitr updated 2 days ago

PaulSchulz/esphome-lora-sx126x #7

Heltec Lora 32V3.2 not compile

Config: ``` esphome: name: lora32-home friendly_name: Lora32_home libraries: - "SPI" - "Ticker" - "SX126x-Arduino" esp32: board: esp32-s3-devkitc-1 framework: …

userosos updated 1 day ago

NVIDIA/TensorRT-LLM #2434

Error in data types: using model with lora

### System Info a100 ### Who can help? @byshiue @juney-nvidia ### Information - [ ] The official example scripts - [x] My own modified scripts ### Tasks - [x] An officially supported task in th…

Alireza3242 updated 1 week ago

arcee-ai/mergekit #459

Qwen2.5 LoRA Extraction not working in vLLM & Aphrodite Engi…

Usually you can use LoRA extraction in mergekit and then run the LoRAs in vLLM or Aphrodite Engine just fine. This works for Llama and Mistral models so far, but it seems like this isn't working for Q…

Nero10578 updated 1 day ago

PaulSchulz/esphome-lora-sx126x #6

Issue Lorawan

Updating /config/esphome/lorawan.yaml ------------------------------------------------------------ INFO ESPHome 2024.10.3 INFO Reading configuration /config/esphome/lorawan.yaml... ERROR Unexpec…

rallep71 updated 3 days ago

vllm-project/vllm #10429

[Bug]: rocm issue

### Your current environment AMD radon + kubernetes ### Model Input Dumps `vllm serve mistralai/Mistral-7B-Instruct-v0.3 --trust-remote-code --enable-chunked-prefill --max_num_batch…

YYXLN updated 5 days ago

Acly/krita-ai-diffusion #1423

An exception after update and now SDXL isn't working, worklo…

It was working fine until the update. But now this. Exception: ``` KeyError Python 3.10.7: D:\Graphics\Krita (x64)\bin\krita.exe Tue Nov 19 20:18:55 2024 A problem occurred in a Python scrip…

GeminiSquishGames updated 2 days ago

sgl-project/sglang #1921

[Bug] Make multi-lora serving compatible with cuda graph and…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

LIUKAI0815 updated 2 days ago

1000+ results for lora-server

1000+ results
for lora-server