attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ZcyMonkey/AttT2M #6

Attention matrix shape

Great work. Thanks for sharing. In the paper it is said that the shape of the adjacancy matrix is (n+1)*(n+1), which sould be 22,22 (21 for joints and 1 for food contact). However, in your implementa…

RohollahHS updated 1 month ago
1
intel-analytics/ipex-llm #11578

Does IPEX-LLM support Flash Attention ?

Hi, i encounter the following error message trying to enable flash attention when running the command below. Can i know is flash attention supported ? ``command: ./main -m $model -n 128 --prompt …

wallacezq updated 3 days ago
3
K-bNd/DINOv1_implem #10

Attention head visualization for ViT

Add attention head visualization in evaluation pipeline

K-bNd updated 4 days ago
3
huggingface/parler-tts #55

Flash Attention Support

Is there any way the Flash Attention 2 support for this model? if there is a way to do it i would love to get involved and help out! I've tried implement by looking at [MusicGen's one ](https://git…

sang-nguyen-ts updated 1 week ago
7
ParadoxZW/LLaVA-UHD-Better #3

Attention mask的计算？

https://github.com/ParadoxZW/LLaVA-UHD-Better/blob/main/llava_uhd/adapt_llava.py#L136-L138 这里由于The first token is for CLS，是不是需要把 ```python m[:w * h] = True ``` 改成 ```python m[:w * h+1] = …

hust-nj updated 3 days ago
14
NVIDIA/TensorRT-LLM #1978

[0.11.0] T5 model running issue

### System Info L4 GPU (AWS G6.12xl) with TensorRTLLM 0.11.0, running with Tritonbackends ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified …

lanking520 updated 1 day ago
2
SHI-Labs/NATTEN #142

circular/wrapping attention

Maybe it is to niche, but we would be interested in a fused circular windowed attention. Currently, we pad q,k,v, use the fused kernel and crop. It would either help if k,v could have different di…

fzimmermann89 updated 3 weeks ago
8
Luke-Luo1/POPDG #3

Why the DS-Attention used in dance decoder only considering …

Hey, @Luke-Luo1 Thank you for your great work! I noticed that the DS-attention used in dance decoder only enhances the attention score of the upper body joints. I just wonder why not do the DS-att…

Andy010902 updated 1 day ago
1
prusa3d/PrusaSlicer #12944

Sliced view scrolling needs attention.

### Description of the bug Not really a bug. But the scrolling in the layer view used to be a lot better. I am not sure if this issue is particular to diferent mice models, but it used to be that fo…

eneene updated 3 days ago
2
rxtan2/Koala-video-llm #5

Regarding Attention Heatmap

Hi, thank you for an interesting work :) I was wondering how the "attention heatmap" in the paper was drawn. If I have understood your method correctly, the learnable parameters are only added to …

rbsohee updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for attention

1000+ results
for attention