gated-attention Search Results

768 results
for gated-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HazyResearch/m2 #3

Code for projecting pre-trained BERT weights into Monarch ma…

Hello, I would like to know if you have published the code to project the pre-trained weights of the BERT model into Monarch matrices. I cannot locate the code for this (I have also looked in the fly …

sinamps updated 1 year ago
2
huggingface/transformers #22372

Add Restormer

### Model description **Restormer: Efficient Transformer for High-Resolution Image Restoration** was published in CVPR 2022, which introduced a new Vision Transformer based architecture for Image Res…

tushdon2 updated 1 year ago
1
GTNewHorizons/GT-New-Horizons-Modpack #16137

Quest suggestion: Ability to skip over wooden track quest by…

### Your GTNH Discord Username @droideka30 ### Your Pack Version 2.5.1 ### Your Proposal If you decide to get into railroading after making an assembler, it's really annoying that the q…

dgealow updated 2 months ago
2
YuchuanTian/DiJiang #6

Provided code seems to have O(n x n x d) computational compl…

Provided code calculates matrix product of q and k. https://github.com/YuchuanTian/DiJiang/blob/main/modeling/pythia-2.8B-dijiang/modeling_gpt_neox_dijiang.py#L286 That means it has computational …

bilzard updated 2 months ago
6
NVIDIA/FasterTransformer #482

FasterTransformer cannot run google/mt5-base

### Branch/Tag/Commit main ### Docker Image Version pytorch-22.08.py3 ### GPU name V100 ### CUDA Driver 515.65 ### Reproduced Steps ```shell MT5 need gelu_new op，but FasterTransformer doesn't…

mpjlu updated 1 year ago
2
HazyResearch/m2 #21

precision on imagenet experiment

Hi, For imagenet, you mentioned in the paper the Hyena code is used for the experimentation by replacing MLP blocks in Hyena ViT-b with block-diagonal matrices, similarly to M2-BERT. Based on the …

Karami-m updated 8 months ago
1
JiahuiYu/generative_inpainting #75

Questions about results from with my own dataset

Hi, Jiahui! After an one-week training on a GTX 1080TI, I found some interesting results from my own dataset. There are 2 kinds of images in my dataset. One is the images with clearly texture like th…

xhh232018 updated 5 years ago
8
rosinality/vq-vae-2-pytorch #31

Error in code for PixelSnail (Proper masking)

first of all, thx for implementation! the question is about proper masking inside the model 1. shift_down and shift right in the beginning of PixelSnail module have already taken care of masking…

rakhimovv updated 4 years ago
2
Fannovel16/comfyui_controlnet_aux #349

Error occurred when executing MediaPipe-FaceMeshPreprocessor…

Error occurred when executing MediaPipe-FaceMeshPreprocessor: Failed to parse: node { calculator: "ImagePropertiesCalculator" input_stream: "IMAGE:image" output_stream: "SIZE:image_size" } nod…

Affegithub updated 1 week ago
9
BUPT/clubber.ml #30

2018-10-28 CAIC(Conversational AI Club)第十二次CAIC沙龙活动通知

wyy0206 updated 5 years ago
1

上一页 1...1 2 3 4 5 6 7...77 下一页

768 results for gated-attention

768 results
for gated-attention