rotary-position-embedding Search Results

476 results
for rotary-position-embedding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #962

CodeLlama-7B int4-awq get error of "The value updated is not…

### System Info CPU x86_64 GPU NVIDIA A10 TensorRT branch: main commid id:cad22332550eef9be579e767beb7d605dd96d6f3 CUDA: NVIDIA-SMI 470.82.01 Driver Version: 470.82.01 CUDA Version: …

activezhao updated 3 days ago
16
NVIDIA/TensorRT #4232

How to remove signal and wait layer in the engine?

## Description Using trt llm to generate llama classification model. I have two similar script to generate engine, the first is raw scripts, the second is base on example/llama/build.sh script. Howev…

lijinghaooo updated 1 week ago
5
mit-han-lab/llm-awq #23

How to measure the speedup of W4A16 kernel like Figure 6？

Hi, Thanks for your outstanding work. I have tested the quantized model using the W4A16 kernel on the WikiText2 datasets. Specially, the WikiText2 validation datasets is split into non-overlapping…

ChenMnZ updated 3 months ago
5
vllm-project/vllm #9324

[Feature]: Quantization support for LLaVA OneVision

### 🚀 The feature, motivation and pitch I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…

salvaba94 updated 1 month ago
2
THUDM/VisualGLM-6B #145

关于VIT的一些问题

ViT是采用智源出品的EVAViT，EVA在Vit的基础上增加了2D rotary position embedding (RoPE) ，请问这个2D的位置编码能够更好的处理像素度较高的图片嘛？

yysirs updated 1 year ago
2
vllm-project/vllm #6281

[New Model]: How to modify WeMM to make it compatible with v…

### The model to consider. Thanks to the efforts of the vllm team. Recently, I am preparing to optimize the inference performance of WeMM, with the link provided below. https://huggingface.co/f…

wenyuzzz updated 3 weeks ago
1
Blaizzy/mlx-vlm #110

[Performance] Speed up Qwen 2 VL

This pattern of mixing in numpy and MLX inside the model forward will really slow things down. It forces a synchronization at each layer and breaks asynchronous evaluation: https://github.com/Blaiz…

awni updated 5 days ago
9
unslothai/unsloth #1044

Issue With Mistral Small

Attempting to generate with Mistral Small causes this error: --------------------------------------------------------------------------- RuntimeError Traceback (most r…

DaddyCodesAlot updated 1 month ago
1
THUDM/ChatGLM-6B #1294

[BUG/Help] <title> Why postional encoding is behind the inpu…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior So as what I am seeing. For the standard MultiHeadAttention, there is a procedure: input -> …

lms-mt updated 1 year ago
1
NVIDIA/TransformerEngine #552

`apply_rotary_pos_emb` significantly hurts training efficien…

This can be reproduced by cloning latest Megatron-LM and enabling transformer_engine for `--transformer-impl` instead of using local implementation. The experiments are run in a `nvcr.io/nvidia/pyt…

pluiez updated 11 months ago
1

上一页 1...1 2 3 4 5 6 7...48 下一页

476 results for rotary-position-embedding

476 results
for rotary-position-embedding