cross-attention Search Results

1000+ results
for cross-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #27285

Implement Cross Attention in LLAMA Model

### Feature request The current implementation of the LLAMA model in the Hugging Face Transformers repository supports self-attention layers as per the standard design of transformer models. I prop…

eitamar-saraf updated 1 month ago
7
vllm-project/vllm #6307

[Feature]: Is there any plan to support Cross-Layer Attentio…

The Cross-Layer Attention (CLA) proposed by MIT recently can significantly reduce runtime memory usage. Does vLLM have any plans to support it? Thanks! Cross-Layer Attention paper: https://arxiv.or…

JiayiFeng updated 2 hours ago
4
NVIDIA/TensorRT-LLM #1956

Question on how to perform cross-attention with FMHA kernel

I am interested in performing multimodal cross-attention. I don't see issues in performing self-attention in encoder since i can use the `BertAttention` plugin. However, cross-attention would have `qu…

Ashwin-Ramesh2607 updated 3 weeks ago
1
aimagelab/multimodal-garment-designer #28

ValueError: cross_attention_dim must be specified for CrossA…

when I run "python src/eval.py --dataset_path --batch_size --mixed_precision fp16 --output_dir --save_name --num_workers_test --sketch_cond_rate 0.2 --dataset --start_cond_rate 0.0 --test_order …

wly-ai-bj updated 3 weeks ago
1
kijai/ComfyUI-FluxTrainer #51

(Resolved) 6x Slower than FluxGYM?

For the same training settings on both trainer: 512x512 Adamw8bit 4 batch size FluxGym : 1.19s/it ComfyUI FluxTrainer: 7.31s/it About 6 times faster on FluxGym, on FluxTrainer my gpu utilisa…

plugcrypt updated 1 day ago
19
lucidrains/FLASH-pytorch #4

Cross-Attention?

Hi, @lucidrains. Thank you for sharing this excellent implementation with us all! Do you have any thoughts as to what changes would need to be made to make cross-attention possible with your `FLASH` …

amorehead updated 1 year ago
2
kxhit/EscherNet #17

the genenrated target views become blurry during training

Hi @fradif96 We don't have new modules for cross/self-attention. It's the same attention layers but just reshape the latent features from ((b t) l d) -> (b (t l) d) [here](https://gith…

fangchuan updated 2 days ago
4
yjh0410/SAMI #1

About the Cross-Attention Decoder

Thank you for your implementation of SAMI's training code. It has been incredibly helpful for me! However,I have a question regarding the pretraining process. In forward preprocess of the [MaeDeco…

yeyeyeping updated 5 months ago
5
Dao-AILab/flash-attention #103

Flash attention for cross attention

Dear Authors, Thank you very much for the open source code. I have a cross attention module for which I would like to use flash attention. But unfortunately, I get the following error: `*** …

VigneshSrinivasan10 updated 1 year ago
1
HengRuiZ/CLIP-UNet #1

Boss? Is there an example of CLIP+U-Net implementing cross a…

Boss? Is there an example of CLIP+U-Net implementing cross attention? / 大老？有CLIP+U-Net实现交叉注意cross-attention的例子吗？

gg22mm updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for cross-attention

1000+ results
for cross-attention