self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

liyuke65535/Part-Aware-Transformer #13

RuntimeError: shape '[1, 14, 14, -1]' is invalid for input o…

Hi, I encountered an issue when loading a pre-trained model in the Part-Aware-Transformer repository. Specifically, when trying to resize the position embedding in the Vision Transformer (ViT), I get …

fldzc updated 3 weeks ago
1
ZionGo6/FADet #1

When will it be open source?

This is an excellent work, when will the code be open sourced?

dkjsednfkjsn updated 1 week ago
1
RoyiRa/prompt-to-prompt-with-sdxl #6

LocalBlend doesn't work

Dear RoyiRa: I try to test whether the LocalBlend can work so I use the run code as following: `import os import torch from prompt_to_prompt_pipeline import Prompt2PromptPipeline from processors …

CONSTANT1386 updated 4 weeks ago
1
Stability-AI/sd3.5 #24

Load controlnet with diffusers

Thanks for this excellent work! But when I use diffusers to load Controlnet: `controlnet = SD3ControlNetModel.from_pretrained("stabilityai/stable-diffusion-3.5-controlnets-depth", torch_dtype=torch…

YutingXiao updated 9 hours ago
1
modelscope/ms-swift #2246

Finetuning Qwen2VL yield error when enabling FlashAttention …

**Describe the bug** When using Flash Attention (--use-flash-attention true) to train Qwen2VL model with mixed data (both image and text data), the code will yield the following error ``` [rank0]: …

VietDunghacker updated 3 weeks ago
7
unslothai/unsloth #1154

"FlashAttention only support fp16 and bf16 data type" error …

I dont know why whenever i set use_dora = True it always give me this error when i train: `RuntimeError Traceback (most recent call last) Cell In[26], line 1 ----> 1 tr…

nguyentd01 updated 1 month ago
3
pytorch/opacus #685

Issue with Seq2Seq model

## 🐛 Bug ## Please reproduce using our [template Colab](https://colab.research.google.com/drive/1R-dnKipK9LOVV4_oKbuHoq4VvGKGbDnd?usp=sharing) and post here the link https://colab.research.g…

kr-ramesh updated 3 weeks ago
1
NVIDIA/Megatron-LM #1005

[BUG] T5 extended attention mask shape mismatch with transfo…

**Describe the bug** Running the most recent version of the T5 pretraining script out of the box raises a Value Error, particularly in the following line: ``` [rank0]: File "/home/miniconda3/lib/…

andrewvli updated 1 month ago
6
turboderp/exllamav2 #682

qwen coder32b run on colab t4

### OS Linux ### GPU Library CUDA 12.x ### Python version 3.10 ### Pytorch version xxxxxxxxxxx ### Model turboderp/Mistral-7B-instruct-exl2 ### Describe the bug ## Warning: Flash Attention…

werruww updated 1 week ago
10
Shakker-Labs/ComfyUI-IPAdapter-Flux #7

!!! Exception during processing !!! Expected query, key, and…

output_data, output_ui, has_subgraph = get_output_data(obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb) ^^^^^^^^^^…

logo66612 updated 1 week ago
3

上一页 1...17 18 19 20 21 22 23...100 下一页

1000+ results for self-attention

1000+ results
for self-attention