self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PacktPublishing/Hands-On-Graph-Neural-Networks-Using-Python #16

HAN -

Thank you for all this work! In the book, Chapter 12, page 209, where a "Hierarchical self-Attention Network" (HAN) model was introduced to handle heterogeneous graphs, the reference [5] (J. Liu, …

trezbit updated 1 week ago
1
antgroup/echomimic_v2 #44

好像卡在了FFMPEG，但是我路径设置了呀，echo %FFMPEG_PATH%也返回正确

(echomimic_v2) Z:\AI\echomimic_v2-main>python app.py A matching Triton is not available, some optimizations will not be enabled Traceback (most recent call last): File "Z:\Users\Administrator\min…

windwolfx23 updated 19 hours ago
1
lucidrains/rotary-embedding-torch #36

Slower than absolute positional embeddings?

Hi @lucidrains, Thanks for creating this wonderful package as well as `x-transformers`. I wanted to understand why rotary embeddings seem to be slower for me than absolute positional embeddings. I'm …

umarbutler updated 1 month ago
4
shvdiwnkozbw/Self-supervised-Video-Concept #3

About Cross Attention

# attention q = q * self.scale # Normalization. attn_logits = torch.einsum('bnd,bld->bln', q, k) attn = self.softmax(attn_logits) attn…

BNU-IVC updated 2 weeks ago
1
HabanaAI/vllm-fork #370

[Feature][habana-main]: llama3.2 support request - HPUAttent…

### 🚀 The feature, motivation and pitch Llama3.2 vision (Mllama) models requires model runner as "Enocoder_Decoder_Model_Runner" which includes: 1. prepare "encoder_seq_lens" and "encoder_seq_len…

xuechendi updated 1 week ago
2
jax-ml/jax #24934

[GPU] FlashAttention performance lags behind PyTorch

## Description I'm benchmarking naive FlashAttention in `Jax` vs. the Pallas's version of [`FA3`](https://github.com/jax-ml/jax/blob/7b9914d711593dca8725d46aa1dadb2194284519/jax/experimental/pallas…

neel04 updated 1 week ago
4
QwenLM/Qwen-VL #427

AssertionError: Only Support Self-Attention Currently

在运行demo阶段，无论是通过transformer还是modelscope方法，模型自动下载到.cache/hugggingface下，并且报错AssertionError: Only Support Self-Attention Currently

jiangsufirstlove updated 3 months ago
2
mli0603/stereo-transformer #21

self attention

hi, I want to know more about self attention in your work. Why this attention is necessary in your transformer working for stereo depth estimation? How self attention contribute to depth estimation?Wh…

mc171819 updated 3 years ago
1
pytorch/pytorch #138556

FlexAttention result deviates with torch.compile() and torch…

### 🐛 Describe the bug Hi, I was testing FlexAttention by comparing its output with that of `nn.MultiheadAttention` and `torch.nn.functional.scaled_dot_product_attention`. In the end, I tracked down …

EIFY updated 3 weeks ago
10
jqtangust/IUF #6

我好像没有找到注意力机制中的con这个参数

您的论文中： ![image](https://github.com/user-attachments/assets/90522342-f265-4852-b69b-77c35cad1095) 但是您的代码： class MultiHeadSelfAttention(nn.Module): def __init__(self, dim, num_heads): s…

Vzoooong updated 4 weeks ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for self-attention

1000+ results
for self-attention