linear-attention-model Search Results

1000+ results
for linear-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huchenlei/ComfyUI-layerdiffuse #103

[Bug]: Inference tensors do not track version counter.

### What happened? ![微信截图_20240826183956](https://github.com/user-attachments/assets/05daea40-7b8f-4f69-81c0-10813fb8d3b5) Error occurred when executing KSampler (Efficient): Inference tens…

designex updated 1 month ago
1
huggingface/transformers #10612

Implementing efficient self attention in T5

# 🌟 New model addition My teammates and I (including @ice-americano) would like to use efficient self attention methods such as Linformer, Performer and Nystromformer ## Model description The…

JamesDeAntonis updated 3 years ago
1
AGI-Edgerunners/LLM-Adapters #57

FT with bottleneck : cannot perform fine-tuning on purely q…

Hi! I'm tried to finetune llama-2-13b with bottleneck Adapter, but it got a ValueError that cannot finetune the model loading by using load_in8bit. What is the problem? How can I solve it? **ValueE…

Lao-yy updated 7 months ago
2
maximzubkov/fft-scan #1

Time and space complexity appear to be quadratic in sequence…

Hi there -- I took a quick look at your code. A key motivation for modeling sequences via linear recurrence relations (instead of, say, self-attention) is that they can be implemented to execute with …

fheinsen updated 9 months ago
12
microsoft/onnxruntime #20667

BeamSearch op returning wrong results on CUDA execution prov…

### Describe the issue com.microsoft::BeamSearch op is outputting wrong values when following conditions are satisfied: - Running on CUDA execution provider - Using _model_type_ = 1 (T5-like mode…

amancini-N updated 4 months ago
2
comfyanonymous/ComfyUI #4411

cannot load flux model anymore

### Expected Behavior i can load flux model yesterday, but i don't know why there the error occur today. ### Actual Behavior please check the coding ### Steps to Reproduce it's not about the work…

huangkun1985 updated 1 month ago
1
YuchuanTian/DiJiang #6

Provided code seems to have O(n x n x d) computational compl…

Provided code calculates matrix product of q and k. https://github.com/YuchuanTian/DiJiang/blob/main/modeling/pythia-2.8B-dijiang/modeling_gpt_neox_dijiang.py#L286 That means it has computational …

bilzard updated 2 months ago
6
unslothai/unsloth #842

inference not respond with finetuned llama 3.1 8B bnb 4 bits…

I refined llama3.1 8b bnb 4bits according to your recommendations with my own train+eval dataset and saved as merged 16 bits. I now want to create an inference by loading the 16b merged model and usin…

dromeuf updated 2 months ago
3
comfyanonymous/ComfyUI #1144

Lora keys not loaded

ENVIRONMENT Windows 10 GPU 1660 Super 32 gb ram So i tried a lora model that i made, and i try to get results from prompts but i get an warning lora keys not loaded and the image is not the de…

Mark-papi updated 3 weeks ago
27
lucidrains/vit-pytorch #126

Different transformer implementations with huggingface vit

Hi Phil, thanks for the great repo. I compared your implementation of ViT with huggingface's (https://github.com/huggingface/transformers/blob/master/src/transformers/models/vit/modeling_vit.py) and…

askerlee updated 3 years ago
4

上一页 1...20 21 22 23 24 25 26...100 下一页

1000+ results for linear-attention-model

1000+ results
for linear-attention-model