attention-model Search Results

1000+ results
for attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rlucas7/code-searcher #1

Notes on implementing a search inspection visualizer

There are a couple different libraries investigated, I am choosing - [x] [BertViz](https://github.com/jessevig/bertviz/tree/master): shows the attentions between `query` from search and a SERP Oth…

rlucas7 updated 1 week ago
2
pytorch/pytorch #139507

performance bug: flex attention much slower than dense atten…

### 🐛 Describe the bug With a 2D spatial neighborhood pattern, flash attention is orders of magnitude slower than dense attention: hlc=2 seq_length : 192 flex attention : 0.0015106382369995117 […

clessig updated 1 week ago
6
CryVeck/QuaRot #2

Rotation destroy perplexity

While using the right hidden size for the rotation, Llama 3 8B model perform better WIKITEXT2 PPL: 11.544 > WIKITEXT2 PPL: 8.967 But, for the other models, while running the fake quant: `pyt…

CryVeck updated 8 hours ago
3
microsoft/MInference #82

[Question]: analysis of attention scores (too sparse)

### Describe the issue I want to ask a general question. When analyzin attention score, I feel that my attention score is quite sparse and their values are also very low. I cannot obtain any valuable…

wiluen updated 1 month ago
2
supersonic13/sd-webui-agent-scheduler #3

Not working in Forge

Last thing in the old repo was to redirect issue here so not sure if this is duplicate of something there but I loved teh extension in a1111 and hope to have it again with forge... This is what I g…

PaulBrousseau updated 21 hours ago
1
ggerganov/llama.cpp #10420

Bug: Vulkan vk::DeviceLostError with multithreaded environme…

### What happened? I would like to begin by expressing my sincere gratitude to the authors for their dedication and effort in developing this work. To provide context for the issue I am encounter…

ddwkim updated 1 day ago
2
sustcsonglin/flash-linear-attention #79

Feature Request (or Guidance Request): Frame-by-Frame Proces…

### Feature request I am working with flash linear attention (FLA) model and aiming to process data in a frame-by-frame manner with [batch, frame, feature] input format. Additionally, I am looking …

LarocheC updated 1 week ago
3
nod-ai/sdxl-scripts #107

Attention regression in ToM compared to MLPerf branch

For reproduction. Input Model: https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/punet.mlir Input data : wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdx…

MaheshRavishankar updated 1 month ago
1
google-ai-edge/mediapipe #5727

Issue of Gemma 7B Q4 quantization

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) Yes ### OS Platform and Distribution Android 14 ### Mobile device if the issue happens on …

moon5bal updated 3 days ago
8
lyuwenyu/RT-DETR #478

Question about Visualizing Self-Attention in RT-DETR Encoder

Hello lyuwenyu, First of all, thank you for your amazing work on RT-DETR! I’ve just started learning about object detection models, and I truly appreciate the innovations that make RT-DETR both fas…

Anchor1566 updated 2 weeks ago
4

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for attention-model

1000+ results
for attention-model