non-autoregressive Search Results

686 results
for non-autoregressive

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/fairseq #1380

positional attention in nonautoregressive_transformer

Hi， thank you for releasing the code about nonautoregressive_transformer. but why I can‘t find the positional attention in decoder as described in the paper [Non-Autoregressive Neural Machine Tr…

JackHorse updated 4 years ago
7
EleutherAI/gpt-neox #43

Add Deepspeed Transformer Kernel

The Deepspeed implemented a transformer kernel that invokes the CUDA kernel only once for Q, K and V values, as opposed to three times (one invocation for Q, K and V respectively), resulting in 3% to …

sdtblck updated 3 years ago
4
espnet/espnet #1752

What is the best english TTS available pretrained for today?

Sorry for a noob question, what is the best (in terms of quality) english TTS available pretrained for today? Is it a following combination or there is something better? 1. Tacotron2 | char_train_n…

qo4on updated 3 years ago
18
giannisdaras/smyrf #4

Auto-regressive

Hi Giannis! Thanks for the great paper! I am interested in your asymmetric LSH, as I think having separate query / key space (as opposed to shared QK as in Reformer) will bring performance improvem…

lucidrains updated 3 years ago
2
huggingface/transformers #10452

BartForConditionalGeneration breaks with label smoothing los…

## Environment info - `transformers` version: 4.3.3 - The other parameters are irrelevant ### Who can help @patrickvonplaten @sgugger ## Information I apologize for not using the prov…

mingruimingrui updated 3 years ago
10
tensorflow/tensorflow #38312

Cannot convert lstm with "stateful=True" to tflite

**System information** - OS Platform and Distribution (e.g., Linux Ubuntu 16.04):Window 7 - TensorFlow installed from (source or binary):binary - TensorFlow version (or github SHA if from source):2…

zrct0 updated 3 years ago
29
rwth-i6/returnn #391

Extending Self Attention

I want to implement some changes to the self-attention used in the Transformer for MT, namely implement locality-sensitive hashing (https://arxiv.org/pdf/2001.04451.pdf). Right now, self-attention …

Zettelkasten updated 3 years ago
81
xinghaochen/awesome-hand-pose-estimation #43

about one paper for citation

Hi , thanks for making such repo. I have one question here: Why do you mark "HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation. " as MM20 paper. I could not find the citatio…

MengHao666 updated 4 years ago
3
qiwihui/pocket_readings #1151

The Decade of Deep Learning

As the 2010’s draw to a close, it’s worth taking a look back at the monumental progress that has been made in Deep Learning in this decade. Tags: deep_learning via Pocket https://ift.tt/…

qiwihui updated 3 years ago
1
facebookresearch/DisCo #3

use disco to training autoregressive model

Hi, Jungo. Thanks for your nice code! I wanna use your disco model to train an autoregressive model as you said in your paper (sec 5.1 : AT with Contextless KVs). I saw there is one args called at-…

NonvolatileMemory updated 4 years ago
9

上一页 1...54 55 56 57 58 59 60...69 下一页

686 results for non-autoregressive

686 results
for non-autoregressive