self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rongzhou7/ADCCA #5

Apply self attention weight

Looking at the code, it seems that there are no weights for key, query, and value when implementing self attention. Is this the correct implementation?

atlas-sky updated 3 weeks ago
1
google/prompt-to-prompt #85

visualizing self attention map

Hi, I wonder how these figures were obtained. SVD on self-attention map would produce U, S, and V.T. How did you obtain the figures?? ![capture](https://github.com/google/prompt-to-prompt/asset…

dain5832 updated 1 month ago
3
pytorch/pytorch #140849

flex_attention + `export_for_inference` ` NYI: querying is_c…

### 🐛 Describe the bug ```python import torch from torch import nn, Tensor from torch.export import export_for_inference, Dim from torch.nn.attention.flex_attention import flex_attention class…

bhack updated 1 week ago
2
casper-hansen/AutoAWQ #657

probability tensor contains either inf, nan or element < 0

Hi Im trying to do inference on a awq quantized model and im constantly getting this error when trying to generate text. Im using Qwen2.5-72B-Instruct-AWQ. Some code to give context: sel…

alvaropastor7 updated 3 days ago
1
OpenRLHF/OpenRLHF #520

Gemma2 Ray vllm

Any possibility to fix this maybe some version of vllm? (LLMRayActor pid=1005) WARNING 11-15 15:19:28 gemma2.py:351] Some weights are not initialized from checkpoints: {'layers.18.mlp.gate_up_proj.…

thehir0 updated 1 week ago
3
Atten4Vis/MS-DETR #18

Few questions about your implementations

Hi, First thank your very much for your work. It adds a huge improvement to DETR family. And your paper was really well explained and written. Also thank you for publishing your code & models, i…

JGuillaumin updated 2 weeks ago
1
aws-neuron/aws-neuron-sdk #1035

Compiling StableDiffusionXL unet(torch.float16) failed.

Hi, I tried a test about compiling unet(torch.float16), which is the part of StableDiffusionXLPipeline in Inferentia2.8xlarge and it failed. When the latent size of unet is (64, 64), it did not fai…

newgrit1004 updated 1 week ago
1
xiaolLIN/MSSR #5

运行报错

Traceback (most recent call last): File "/home/yy/MSSR-main/MSSR-main/run_model.py", line 39, in run_result = run_recbole(model=args.model, dataset=args.dataset, config_file_list=config_file_…

lovedididi updated 21 hours ago
9
embeddings-benchmark/mteb #1415

[mieb] Salesforce/blip-image-captioning-large buggy

Just realized i get the below warning with Salesforce/blip-image-captioning-large ; i think i already ran results for it, but they're probably random in that case; maybe someone could check the result…

Muennighoff updated 2 weeks ago
1
hongshi97/CAD #2

Using `flash_attention_2` raises a `ValueError` for `padding…

Thank you for developing this! ## Context Due to lenghty computation time and in order to speed things up, I thought about using the `flash_attention_2` and smaller floating points `torch.float16`…

ylkhayat updated 2 days ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for self-attention

1000+ results
for self-attention