linear-transformer Search Results

NVIDIA/TransformerEngine #654

PIP Installation Failed

Hello I want to install TE using pip: `pip install git+https://github.com/NVIDIA/TransformerEngine.git@stable` But I got the following error during installation: ``` Collecting git+https://gi…

mahdip72 updated 2 months ago

yuqinie98/PatchTST #69

关于model参数的问题

请问为什么model参数里面只有informer、autoformer等几个模型啊，没有见到pathctst

1348598339 updated 4 months ago

ahaliassos/raven #5

The configuration file for the large model has incorrect par…

Some parameters in the configuration file are inconsistent with the provided model parameters. For example, in conf/model/visual_backbone/resnet_transformer_large.yaml (audio backbone may also have a …

jinchiniao updated 1 year ago

MouYongli/LLMs4OL #1

LoRA Paper(https://arxiv.org/pdf/2106.09685)

我们可以将LoRa应用于nn中任意的权重矩阵之间 1. MLP中，有两个权重矩阵： **输入层到隐藏层的权重矩阵**：这个权重矩阵用来连接输入层和隐藏层，它的大小是由输入特征的维度和隐藏层神经元的数量决定的。每一行对应一个隐藏层神经元，每一列对应输入层的一个特征。这个权重矩阵用来将输入特征线性组合成隐藏层的输出。 **隐藏层到输出层的权重矩阵**：这个权重矩阵用来连接隐藏层和输出…

Kleinpenny updated 2 months ago

TencentQQGYLab/ComfyUI-ELLA #42

Error occurred when executing T5TextEncode #ELLA: (RX580 i39…

Error occurred when executing T5TextEncode #ELLA: "addmm_impl_cpu_" not implemented for 'Half' File "C:\Users\WarMa\OneDrive\Escritorio\ComfyUI\ComfyUI\execution.py", line 151, in recursive_exec…

KillyTheNetTerminal updated 2 months ago

keras-team/keras-nlp #98

Add the gMLP Encoder Block

The gMLP model is from the paper "[Pay Attention to MLPs](https://arxiv.org/abs/2105.08050)". It has a decent number of citations - around 40. Every Encoder Block merely consists of linear layers, a "…

abheesht17 updated 2 months ago

narsisn/Argoverse2_Motion_Forecasting #2

Documentation of this repo

Hi, Thank you for your wonderful work. Could you provide more details about the structure of the pipeline? What are the differences between TGR and MTMF models? Comparing TGR and Crat-PRED, you …

Cram3r95 updated 1 year ago

NetEase-FuXi/EETQ #21

Does it support Vision Transformers?

Hi, I would like to know if ViT supports Eetq and LoRA and if I can have an example of this: `from transformers import ViTForImageClassification, ViTImageProcessor from peft import get_peft_model, L…

PaulaDelgado-Santos updated 1 month ago

pytorch/pytorch #120189

Making Mamba first-class citizen in PyTorch

### 🚀 The feature, motivation and pitch [Mamba](https://arxiv.org/pdf/2312.00752.pdf) is a new SSM (State Space Model) which is developed to address Transformers’ computational inefficiency on long…

yanboliang updated 1 month ago

lucidrains/taylor-series-linear-attention #2

Replicating Results?

Thank you for the code! I've been using it as a reference for my own implementation. Have you replicated the results in the original blogpost..? Based on your update in the readme, it seems like you h…

fattorib updated 5 months ago

1000+ results for linear-transformer

1000+ results
for linear-transformer