positional-encoding Search Results

1000+ results
for positional-encoding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenNMT/CTranslate2 #1657

Error when converting NMT model with ALiBi or RoPe

Hello, thank you for a great project! I am getting this error when using ALiBi or RoPe positional encoding in a tranformer NMT model from OpenNMT-py: KeyError: 'encoder.embeddings.make_embedding…

randomicity updated 6 months ago
15
e4exp/paper_manager_abstract #566

Stable, Fast and Accurate: Kernelized Attention with Relativ…

- https://arxiv.org/abs/2106.12566 - 2021 Transformerの重要なコンポーネントであるAttentionモジュールは、その2次的な複雑さのために、長いシーケンスに対して効率的に拡張することができません。多くの研究は、オリジナルのアテンションにおけるdot-then-exponentiate softmax関数の近似に焦点を当てており、2次…

e4exp updated 3 years ago
1
NVIDIAGameWorks/kaolin #554

Encoder design in DMTet

Hi, Great work. Benefit a lot from kaolin! I've learned the paper "DMTet" which describes PVCNN as the input encoder. However, I only find "MLP + positional encoding" in kaolin implementation. …

irichyoung updated 1 year ago
6
SoulProficiency/speechseparation-Sandglasset #3

The positional encoding matrix as shown in equation (7) in t…

The positional encoding matrix as shown in equation (7) in the paper has not considered in the class Global_SAN for defining the self-attentive network. I think without this, there is no mechanism to …

mrezasoltani updated 2 years ago
2
meta-introspector/pyprolog #1

Topos

I see, thank you for providing more context. Let me summarize my understanding of your approach: 1. You've flattened JSON representations of ASTs into a tabular format. 2. This table has many colu…

jmikedupont2 updated 3 months ago
2
VainF/Torch-Pruning #337

Question about positional embedding when pruning SAM model

Hi,there! It seems that the positional encoding in SAM is too complex for torch_pruning. [https://github.com/czg1225/SlimSAM/issues/10](url) In fact, there is an infinite loop at the **_fix_dependen…

tasakim updated 4 months ago
1
CStanKonrad/long_llama #4

FoT attention and the scaling trick

In your paper, you say > Position Interpolation (PI, [Chen et al., 2023] and [kaiokendev, 2023]) introduces a modification to the rotary positional encoding scheme that enables fine-tuning for 32K…

StrangeTcy updated 1 year ago
3
vllm-project/vllm #9875

[Bug]: Running on a single machine with multiple GPUs error

### Your current environment Name: vllm Version: 0.6.3.post2.dev171+g890ca360 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I used the interface from this vllm repository …

Wiselnn570 updated 6 days ago
6
rikdz/GraphWriter #17

About Position Encoding

Hey，i noticed that you did not use the positional encoding in the model but the orginal Transformer Model used the triangle positional encoding, why did not you use that ? was the PE useless?

wutaiqiang updated 4 years ago
1
THUDM/ChatGLM-6B #1294

[BUG/Help] <title> Why postional encoding is behind the inpu…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Current Behavior So as what I am seeing. For the standard MultiHeadAttention, there is a procedure: input -> …

lms-mt updated 1 year ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for positional-encoding

1000+ results
for positional-encoding