attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ultralytics/ultralytics #15996

attention layer in yolov9

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

Umsh updated 2 months ago
3
Dao-AILab/flash-attention #352

[v2] Attention Masking

Is any plan to add attention masking support? PyTorch's version of flash attention v1 included the ability to provide an attention mask in their [implementation](https://pytorch.org/docs/stable/genera…

MikeynJerry updated 2 weeks ago
20
pytorch/pytorch #139621

[ROCm] [Upstream Triton] Flex attention `Assertion idx < siz…

### 🐛 Describe the bug Encountered while testing for preview release/2.6 builds as part of https://github.com/pytorch/pytorch/issues/139175 ```python test_flex_attention.py -k "test_load_from_bias…

jataylo updated 1 day ago
3
callsys/GenPromp #13

您好，感谢您在该领域做出的贡献！我在复现您提供的权重去测试时出现以下错误，该如何解决呢谢谢

python main.py --function test --config configs/cub_stage2.yml --opt "{'test': {'load_token_path': 'ckpts/cub983/tokens/', 'load_unet_path': 'ckpts/cub983/unet/', 'save_log_path': 'ckpts/cub983/log.tx…

jjxyhb updated 3 days ago
1
microsoft/onnxruntime #22532

DistilBERT model inference failure using ONNX Runtime QNNExe…

Description: When running inference on the distilbert-base-uncased model using the NPU on Snapdragon® X Elite (X1E78100 - Qualcomm®) through ONNX Runtime's QNNExecutionProvider, the model fails to inf…

sean830314 updated 1 day ago
5
ROCm/flash-attention #80

Install flash-attention failed

I install the flash-attention and follow this link: https://rocm.blogs.amd.com/artificial-intelligence/flash-attention/README.html my GPU is gtx1100(7900XTX) I install it in the docker and the d…

sdfasfsdfasfasafd updated 1 month ago
14
Jiangbo-Shi/SGMF #2

Attention heatmaps

Hello, when I was building attention heatmaps, I found that the attention scores across different patches did not vary much. Have you encountered this problem before?

Noirombre updated 2 months ago
1
nod-ai/SHARK-Platform #405

Mismatches between config.json exported by export_paged_llm_…

The following edits were required to make llama3 8b fp16 work: ``` config["attn_head_count"] = 8 # 8 instead of 32 config["paged_kv_cache"] = {} config["paged_kv_cache"]["block_seq_stride"] = conf…

renxida updated 6 days ago
1
NVIDIA/Megatron-LM #1259

[BUG] Flash attention cannot be applied by passing the --use…

Passing the --use-flash-attn flag is intended to enable flash attention; however, when the --use-mcore-models flag (to use the transformer engine) is also specified, flash attention will not be applie…

efsotr updated 3 days ago
1
suous/cs224n #1

Confusion regarding Causal Cross Attention

Hi, so in the Causal Cross Attention, I see we are registering a causal mask being the lower triangular matrix. However when we are trying to learn the latent parameters C of seq_len m such that m < …

arnavc1712 updated 1 month ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for attention

1000+ results
for attention