attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #7442

[Feature]: sliding window attention for odd layers

### 🚀 The feature, motivation and pitch Thanks for fixing the soft-capping issue of the Gemma 2 models in the last release! I noticed there's still a [comment](https://github.com/vllm-project/vllm/bl…

tanliboy updated 3 weeks ago
1
ofirpress/attention_with_linear_biases #5

Modifying ALiBi for Encoder-Attention or Cross-Attention

In our paper we only showed results on causal language models, which use causally masked (decoder) self-attention. If you'd like to use ALiBi for seq2seq tasks such as translation, speech or T5, o…

ofirpress updated 4 months ago
29
Mark12Ding/SAM2Long #1

Can we handle new object?

Hi there, Thank you for your wonderful work I have a few questions I would like to ask: 1. Is it possible to handle the appearance of a new object in a video? Specifically, can we detect a new ob…

nguyenquivinhquang updated 2 weeks ago
1
muzammildafedar/udayah #32

Add Animations to Feature Section for Enhanced Engagement an…

The feature section lacks interactivity and visual appeal, making it less engaging for visitors, which may hinder the website's ability to capture attention. Description of the solution I'd like …

RAj5517 updated 1 month ago
1
Haiyang-W/TokenFormer #1

Maybe we can apply `ring attention` to scale up token former…

Nice paper on making LLM fully attention-based. However, I noticed that the largest model discussed in the paper is a 1.5B model. I wonder if the pattention layer is difficult to tensor parallelize…

reyoung updated 1 week ago
1
AnswerDotAI/RAGatouille #170

Get token level attention

Hey, How can I get token-level contributions for the search query? This seems one of the strong benefits of ColBERT for highlighting relevant matches, but for some reason, I can't find any implemen…

TeemuSo updated 2 months ago
1
HazyResearch/zoology #29

Question about MQAR eval of Based

Hey, thanks for the great work. I could be wrong, but I feel like there is a disconnect between what is mentioned in the Based paper and what is used in the Figure 2 config for MQAR eval. In the paper…

Hprairie updated 1 week ago
3
patrick-kidger/equinox #732

Flash Attention

I think having flash attention in `equinox` should be a critical issue considering this is already natively built-in torch. While XLA is supposed to (in theory) do some of the fusion, and possibly …

neel04 updated 5 months ago
6
dxos/dxos #7264

[composer] attention jumps from stack

If document is open in a plank and stack attention jumps from the stack back to the standalone plank https://github.com/user-attachments/assets/5f81904d-c0ea-46a3-89db-dd715143fd53

wittjosiah updated 1 month ago
1
pytorch-labs/attention-gym #22

[flex_attention] Softcap perf questions

I am using [this pytorch provided script](https://github.com/pytorch/pytorch/blob/main/benchmarks/transformer/score_mod.py) to benchmark flex attention with eager and got the attached results ([defaul…

meshtag updated 2 months ago
6

上一页 1...29 30 31 32 33 34 35...100 下一页

1000+ results for attention

1000+ results
for attention