attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1774

Migrate multipack to refactored flash attention

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/axolotl-ai-cloud/axolotl/discussions/categories/ideas) …

casper-hansen updated 2 months ago
1
eeyhsong/NICE-EEG #4

self-attention

When I read the code in your nice_stand.py file, I didn't see you using self-attention or graph attention mechanisms, but you describe this part in your paper ![图片1](https://github.com/eeyhsong/NICE-…

WLC125630WLC updated 3 months ago
4
Dao-AILab/flash-attention #1049

Flash Attention 3 3090 support

Will there be 3090 support on Flash Attention 3 in the future?

win10ogod updated 1 month ago
6
unslothai/unsloth #1014

Flash Attention 2 doesn't work with Gemma 2 models

I have a conda env where "FA2=True" and another env where "FA2=False" (as dispayed in the terminal when run the finetuning script), the vRAM usuable of tuning the same Gemma 2 model (2b or 9b) are the…

rohhro updated 5 days ago
2
DaiShiResearch/TransNeXt #13

About attention_cuda.py and attention_native.py files

Hello, I would like to ask, what are the attention_cuda.py and attention_native.py files in the classification folder and are they modules? I would be very grateful if your team could answer.

logicvanlyf updated 3 months ago
1
google/jax #23349

`jax.nn.dot_product_attention` does not respect `key_value_s…

### Description Perhaps I am using this function incorrectly, but I get data leaks when using `key_value_seq_lengths`. It appears as though both the `xla` and `cudnn` implementations in jax nightly…

danjenson updated 1 week ago
5
lllyasviel/stable-diffusion-webui-forge #1407

ModuleNotFoundError: No module named 'ldm' (webui regional …

``` *** Error loading script: attention.py Traceback (most recent call last): File "C:\Users\ZeroCool22\Desktop\webui_forge\webui\modules\scripts.py", line 525, in load_scripts s…

ZeroCool22 updated 1 week ago
10
singlesoup/orangelist #73

How to make people use what you have build. And What do they…

Where and how will you find the people who will download the app. And how will you get their attention in this very short attention span generation.

singlesoup updated 1 week ago
1
vllm-project/vllm #7397

[Misc]: Cross-attention QKV computation is inefficient

This issue is not in response to a performance regression. The method of performing cross-attention QKV computations introduced in #4942 could be improved. Because this issue relates to cross-atten…

afeldman-nm updated 1 month ago
1
zhengchuanpan/GMAN #37

group spatial attention

论文提到采用了把节点分组的方式，理乱上减少了计算的复杂度，请问在代码中计算空间注意力这一块儿，哪里体现了分组计算呢？

lympassion updated 2 months ago
3

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for attention

1000+ results
for attention