attention-seq2seq Search Results

huggingface/transformers #8711

Model predictions wrong

## Environment info - `transformers` version: 3.5.0 - Platform: Linux - Python version: 3.7 - PyTorch version (GPU?): - Tensorflow version (GPU?): 2.3.1 - Using GPU in script?: Yes - U…

brunopistone updated 3 years ago

huggingface/transformers #9563

finetune_trainer.py script is not using given config file

## Environment info - `transformers` version: 4.1.1 (stable) - Platform: Google Colab - Python version: 3.6.9 - PyTorch version (GPU?): 1.7.0+cu101 - Using GPU in script?: yes - Using distri…

marcoabrate updated 3 years ago

tensorflow/addons #647

Questions about AttentionWrapper

tensorflow 1.15 has a function "tf.contrib.seq2seq.AttentionWrapper", but as we know tf.contrib module is no longer available in TensorFlow 2.0. I find "tfa.seq2seq.AttentionWrapper" may replace "tf.c…

lishi0927 updated 3 years ago

lturing/tacotronv2_wavernn_chinese #52

如何加载 gmm attention

老师好，我尝试加载 gmm attention，修改了 tacotron_gmm.py ，头部改成如下，但无法运行。请教解决方案。谢谢！ import tensorflow as tf from tacotron.utils.symbols import symbols from tacotron.utils.infolog import log from tacotron.…

lower-fish updated 3 years ago

huggingface/transformers #9344

MBart prepare_seq2seq_batch

- `transformers` version: 4.1.1 - mBART: @patrickvonplaten ## Information Model I am using (Bert, XLNet ...): mBART The problem arises when using: * [x] the official example scripts: (give deta…

Chiyu-Song updated 3 years ago

shibing624/pycorrector #184

如何训练自身数据模型

我需要训练一个中文地址数据纠错模型，有如下几点疑问： 1. 对于中文地址的纠错，其用jieba分词错误率可能较高，是否可以直接使用字符级分割方法？ 2. 如果使用字符级分割方法，数据集的格式是怎样的？（要和说明中一致，先分词，分词之间用空格隔离。错误数据和正确数据之间使用tab间隔？），要如何准备？一般用的标注工具是什么？ 3. readme中提到的“conv_seq2seq、seq2s…

lmw0320 updated 3 years ago

huggingface/transformers #3891

Allow one to return encoder attentions in seq2seq generation

# 🚀 Feature request Please could we have the ability to return attention weights from the decoded generated tokens to the encoded source? ## Motivation To attribute the decoded text. E.g in t…

aced125 updated 3 years ago

flashlight/wav2letter #797

Fine tune model using fork command

### Question I'm trying to fine tune seq2seq model using fork command and I got this message error target contains elements out of valid range [0, num_categories) in categorical cross entropy ####…

akhaled89 updated 3 years ago

pkouris/abtextsum #5

Training is always quitting on me.

So, I tried to follow the procedure you described here for a Bengali Dataset: https://github.com/pkouris/abtextsum/issues/3 I have tried to follow this in 3 different machines (because I thought i…

istiakshihab updated 3 years ago

huggingface/transformers #7489

Use of global attention of Longformer when generating

I'm training Longformer2Roberta, the encoder part of this Seq2Seq model is Longformer. The one feature Longformer brings is global attention, I found the use of it during training, but it is never use…

alexyalunin updated 3 years ago

1000+ results for attention-seq2seq

1000+ results
for attention-seq2seq