streamingllm Search Results

143 results
for streamingllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #2025

[Bug] llama3.1-8b smoothquant error (use latest version: 5fa…

### System Info ### System Info GPU: NVIDIA A100 Driver Version: 545.23.08 CUDA: 12.3 versions: https://github.com/NVIDIA/TensorRT-LLM.git (5fa9436) (latest version) https://github.com/trit…

fan-niu updated 2 weeks ago
7
hiyouga/LLaMA-Factory #3393

Help: Where is the example for longlora and streaming llm

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Hi, I am quite new to llama factory framework, I am not able to find the config.yaml for longlora and st…

janenie updated 5 months ago
1
Infini-AI-Lab/TriForce #4

Does Retrieval w/o Hierarchy test with spec decoding?

I have a question on paper results. ![image](https://github.com/Infini-AI-Lab/TriForce/assets/50622684/d69216c5-1b99-466e-b1e6-b1134b140abc) Does Retrieval w/o Hierarchy test with normal speculati…

bxyb updated 5 months ago
1
NVIDIA/TensorRT-LLM #1433

can't save the engine when running triton-build

### System Info 3090 server ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported task in the …

YunChen1227 updated 5 months ago
7
NVIDIA/TensorRT-LLM #2043

trtllm-build Mixtral-8x7B-v0.1 fp16 failed

### System Info - NVIDIA A100 80G * 2 - Libraries - TensorRT-LLM 0.11.0 - Driver Version: 525.105.17 - CUDA Version: 12.4 ### Who can help? @byshiue ### Information - [X] The official exa…

vonchenplus updated 2 months ago
8
NVIDIA/TensorRT-LLM #1676

quantize.py fails to export important data to config.json (e…

### System Info 4x NVIDIA H100, TensorRT-LLM backend 0.9.0 ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks -…

janpetrov updated 2 months ago
23
NVIDIA/TensorRT-LLM #1916

Error Building Engine for Phi-3 Model: Shape Mismatch in Ten…

### System Info - CPU Architecture x86_64 - GPU: NVIDIA H100 - TensorRT-LLM v0.10.0 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scri…

BugsBuggy updated 2 months ago
2
NVIDIA/TensorRT-LLM #2018

llama 3.1 70B Instruct would not build engine "TypeError: se…

### System Info - x86_64 - GPU Mem: 640GB - CPU Mem: 1.5TB - 8 * NVIDIA H100 - TensorRT-LLM Version: `0.12.0.dev2024072301` - TensorRT-LLM Commit `5fa9436e17c2f9aeace070f49aa645d2577f676b` - T…

christian-ci updated 2 months ago
7
NVIDIA/TensorRT-LLM #2223

how to add multi lora support for glm4. model ?

modified : tensorrt_llm.models.chatglm.model.py def use_lora(self, lora_config: LoraConfig): trtllm_modules_to_hf_modules = { "attn_qkv": "query_key_value", "att…

tongjinle123 updated 1 week ago
1
collabora/WhisperLive #277

TensorRT Segmentation fault

I'm trying to run the TensorRT version of the docker container according to instructions, but am getting a segfault whenever I attempt to transcribe any audio. The same audio works with the Faster whi…

matuszelenak updated 1 week ago
6

上一页 1...5 6 7 8 9 10 11...15 下一页

143 results for streamingllm

143 results
for streamingllm