self-attention-network Search Results

arXivTimes/arXivTimes #760

Self-Attention Generative Adversarial Networks

## 一言でいうと既存のGANはCNNベースのため局所特徴に依存しており、離れた場所の情報を参照することができない。そのため、Attentionの仕組みを導入して離れた局所特徴を重みをかけて参照できるようにする手法。局所特徴とAttention情報の利用の度合いは、係数でもって調整を行う。 ![image](https://user-images.githubusercontent.…

icoxfog417 updated 6 years ago

shimopino/papers-challenge #144

Self-Attention Generative Adversarial Networks

### 論文へのリンク [[arXiv:1805.08318] Self-Attention Generative Adversarial Networks](https://arxiv.org/abs/1805.08318) ### 著者・所属機関 Han Zhang, Ian Goodfellow, Dimitris Metaxas, Augustus Odena - …

shimopino updated 4 years ago

shimopino/papers-challenge #225

Global Self-Attention Networks for Image Recognition

### 論文へのリンク [[arXiv:2010.03019] Global Self-Attention Networks for Image Recognition](https://arxiv.org/abs/2010.03019) ### 著者・所属機関 Zhuoran Shen, Irwan Bello, Raviteja Vemulapalli, Xuhui Jia,…

shimopino updated 3 years ago

snap-research/weights2weights #3

error loading lora in diffusers

I try to load lora with pipe.load_lora_weights("/content/adapter_model.safetensors") but it gives an error diffusers==0.28.0 model ='stablediffusionapi/realistic-vision-v51' --------------…

loboere updated 1 month ago

NVIDIA/TensorRT #4005

Inconsistencies in output when running inference on DeBERTa …

## Description When running inference with TensorRT's disentangled attention plugin on Microsoft's implementation of DeBERTa , I noticed that I get inconsistent output when running with dynamic seq…

jkim695 updated 1 month ago

heykeetae/Self-Attention-GAN #57

the meaning of Gamma in Attention model

Thanks for your amazing work first. I am quite confused about the "gamma" in Attention Model, can you explain the meaning of "Gamma"? I also can not find the parameter in the original paper. Th…

hank19960918 updated 6 days ago

e4exp/paper_manager_abstract #543

Self-Attention Networks Can Process Bounded Hierarchical Lan…

- https://arxiv.org/abs/2105.11115 - 2021 ACL 自己注目型ネットワークは、NLPにおいて素晴らしい性能を発揮しているにもかかわらず、最近、階層構造を持つ形式言語の処理には限界があることが証明された。例えば、𝖣𝗒𝖼𝗄k（k種類の括弧がよく入れ子になった言語）などである。これは、形式言語では弱すぎるモデルでも、自然言語はうまく近似できること、…

e4exp updated 3 years ago

tatsuropfgt/papers #13

On the Global Self-attention Mechanism for Graph Convolution…

# On the Global Self-attention Mechanism for Graph Convolutional Networks [[Wang+, 20](https://arxiv.org/abs/2010.10711)] ## Abstract - Apply Global self-attention (GSA) to GCNs - GSA allows GCNs…

tatsuropfgt updated 8 months ago

NVIDIA/TensorRT-LLM #1930

failed to load whisper decoder engine with paged kv cache

### System Info - CPU architecture: x86_64 - CPU/Host memory size: 32GB DDR4 - GPU properties - GPU name: RTX 3070 Ti - GPU memory size: 8GB - Libraries - TensorRT-LLM version: 0.12.0.d…

MahmoudAshraf97 updated 4 days ago

e4exp/paper_manager_abstract #572

How Can Self-Attention Networks Recognize Dyck-n Languages?

- https://arxiv.org/abs/2010.04303 - 2020 EMNLP 本研究では、Dyck-n (n) 個の言語の認識を、自己注意（SA）ネットワークで行うことに注目する。本研究では、開始記号を持つSA（SA+）と持たないSA（SA-）という2種類のSAの性能を比較した。その結果、SA+は、より長い配列やより深い依存関係に一般化できることがわかった。ま…

e4exp updated 3 years ago

1000+ results for self-attention-network

1000+ results
for self-attention-network