self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchtune #1755

Error while running inference with generate_v2.py after one …

Hey, I made a small change in generate_v2.py to run a loop to the whole test set. I am getting some error because of cacheing I guess. I have pasted the error message and code below which i am getting…

Vattikondadheeraj updated 1 month ago
1
arijitray1993/COLA #4

Baseline CLIP Model

I am trying to reproduce the Baseline CLIP results for Single object GQA setting. I am getting a much lower mAP of 0.18, which does not match the paper's numbers, I am using the pooled output of CLIP'…

vishwa27yvs updated 19 hours ago
2
UKPLab/sentence-transformers #2983

Generating embeddings with ONNX Runtime leads to errors

With the new release of version 3.2.0, the use of ONNX has become much easier but initial local tests led to various errors, meaning that it was not possible to use ONNX Runtime via Sentence Transform…

aoezdTchibo updated 3 weeks ago
3
tenstorrent/tt-forge-fe #123

[Llama 3B] Support for attention block (no KV cache)

## Summary For the full Llama 3B model bringup, we want to test the main standalone blocks before running full model e2e. One of those blocks is the attention module. ## Details For initial Llama…

nvukobratTT updated 1 week ago
6
yy-degit/Scented-EAE #2

请问模型换成T5需要做哪些调整？

我希望使用T5来复现，但是将模型替换后出现了缺失 decoder_input_ids 的问题，如下所示： ``` Traceback (most recent call last): File "/mnt/bn/songhengrui-nas/Scented-EAE/main.py", line 85, in main() File "/mnt/bn/songhengr…

bearshr updated 3 days ago
8
thu-ml/SageAttention #55

LLM acc problem

Hi, I set` F.scaled_dot_product_attention = sageattn`, in modeling_llama.py, and run the inference code, I see it run `sageattn_qk_int8_pv_fp16_cuda` in `sageattention/core.py`. The results are: …

laomao0 updated 2 hours ago
9
rhymes-ai/Allegro #28

"No available kernel. Aborting execution." I installed all o…

(allegro) D:\PyShit\Allegro>python single_inference.py ^ More? --user_prompt "A seaside harbor with bright sunlight and sparkling seawater, with manyboats in the water. From an aerial view, the boats…

MinervaArgus updated 1 month ago
1
openvinotoolkit/openvino #27099

[Bug]: NPU compile: L0 zeFenceHostSynchronize result: ZE_RES…

### OpenVINO Version 2024.3 ### Operating System Ubuntu 20.04 (LTS) ### Device used for inference NPU ### Framework PyTorch ### Model used torch.nn.MultiheadAttention ### Issue description …

Zctoylm0927 updated 1 month ago
2
magic-quill/MagicQuill #63

out of memery error in windows

i have 3060 ti with 8gb vram. when i run Loading personal and system profiles took 953ms. (base) PS C:\Windows\system32> e: (base) PS E:\> cd MagicQuill (base) PS E:\MagicQuill> conda activate…

thelatinodancer updated 4 days ago
1
vllm-project/vllm #7397

[Misc]: Cross-attention QKV computation is inefficient

This issue is not in response to a performance regression. The method of performing cross-attention QKV computations introduced in #4942 could be improved. Because this issue relates to cross-atten…

afeldman-nm updated 2 weeks ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for self-attention

1000+ results
for self-attention