rotary-position-embedding Search Results

476 results
for rotary-position-embedding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #775

Is fast rope exactly equivalent to llama's apply_rotary_pos_…

Is fast rope exactly equivalent to llama's apply_rotary_pos_emb? I constructed a test case and found that the result is not exactly equivalent. Is there anything wrong with my case code: ---------…

starstream updated 1 month ago
3
AkihikoWatanabe/paper_notes #1310

RoFormer: Enhanced Transformer with Rotary Position Embeddin…

# URL - https://arxiv.org/abs/2104.09864 # Affiliations - Jianlin Su, N/A - Yu Lu, N/A - Shengfeng Pan, N/A - Ahmed Murtadha, N/A - Bo Wen, N/A - Yunfeng Liu, N/A # Abstract - Position e…

AkihikoWatanabe updated 1 month ago
3
google-ai-edge/ai-edge-torch #254

Trace model in model-explorer

### Description of the bug: hi @pkgoogle, i have some question about computer graph with tinyllama. - i can't see rotary position encoding in the computer graph, i just can see `tok embedding`.…

nigelzzzzzzz updated 3 weeks ago
13
5g4s/paper #44

ROFORMER: ENHANCED TRANSFORMER WITH ROTARY POSITION EMBEDDIN…

https://arxiv.org/abs/2104.09864 https://blog.eleuther.ai/rotary-embeddings/

5g4s updated 1 year ago
4
bkitano/llama-from-scratch #8

get_rotary_matrix

I noticed that: ``` def get_rotary_matrix(context_window, embedding_dim): R = torch.zeros((context_window, embedding_dim, embedding_dim), requires_grad=False) for position in range(context…

nkkbr updated 5 months ago
1
e4exp/paper_manager_abstract #408

RoFormer: Enhanced Transformer with Rotary Position Embeddin…

- https://arxiv.org/abs/2104.09864 - 2021 変換器アーキテクチャにおける位置エンコーディングは、配列中の異なる位置にある要素間の依存関係モデリングのための監督を提供する。本研究では、変換器ベースの言語モデルで位置情報を符号化するための様々な方法を調査し、Rotary Position Embedding(RoPE)という新しい実装を提案する。 …

e4exp updated 3 years ago
3
NVIDIA/Megatron-LM #1132

[BUG]"Unexpected key(s) in state_dict" while loading Llama-m…

Hi, I tried to finetune Llama2-7b-chat model using megatron. I downloaded the hf checkpoint and convert it to GPT megatron checkpoint referring [https://github.com/NVIDIA/Megatron-LM/blob/fe1640a3cc48…

mxjmtxrm updated 4 days ago
18
long8v/PTIR #121

[112] RoFormer: Enhanced Transformer with Rotary Position Em…

[paper](https://arxiv.org/pdf/2104.09864.pdf), [code](https://github.com/huggingface/transformers/blob/v4.28.1/src/transformers/models/roformer/modeling_roformer.py#L318-L343) ## TL;DR - **I r…

long8v updated 1 year ago
1
vllm-project/vllm #5190

[Bug]: loading squeezellm model

### Your current environment I used 0.4.3 version, pip install, cuda vsesion 12.0, A100 GPU RuntimeError: t == DeviceType::CUDA INTERNAL ASSERT FAILED ### 🐛 Describe the bug ``` INFO 06-02 03…

yuhuixu1993 updated 3 weeks ago
2
vllm-project/vllm #6478

[Bug]: AttributeError: '_OpNamespace' '_C' object has no att…

### Your current environment ```text Versions of relevant libraries: [pip3] flashinfer==0.0.9+cu121torch2.3 [pip3] numpy==1.26.4 [pip3] nvidia-nccl-cu12==2.20.5 [pip3] sentence-transformers==3.0…

choco9966 updated 2 months ago
14

上一页 1...1 2 3 4 5 6 7...48 下一页

476 results for rotary-position-embedding

476 results
for rotary-position-embedding