tensor2tensor Search Results

1000+ results
for tensor2tensor

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/tensor2tensor #840

*Question* distributed training parameter setting and perfor…

### Description I have trained a simple NMT dnn using the transformer model on a small dataset and I am pretty impressed by the good result achieved with just 4500 steps. Now the problem arises when …

EdwardZhang88 updated 6 years ago
2
tensorflow/tensor2tensor #1747

Use transformer encoder for sequence labeling

I would like to use the transformer architecture for a sequence-labeling problem. I have two files, one consisting of the input tokens, and the other one of the labels. The labels are short strings an…

sebastian-nehrdich updated 4 years ago
4
tensorflow/tensor2tensor #762

*help* Merge transformer with other layers

Hi, I would like to create a new model that uses the transformer encoder and decoder. However, it also needs to include layers in between and the decoder needs to use not only the output of the tr…

Danysolism updated 6 years ago
3
lanpa/tensorboardX #394

Add Graph len of 0-d tensor

**Describe the bug** My model makes use of `torch.nn.fold` quite a bit, when I try to use `add_graph` on my model I get the following exception: ``` in merge(self, x) 39 40 …

Queuecumber updated 5 years ago
3
tensorflow/tensor2tensor #1110

Multi-GPU gives no speedup for transformer model

### Description I am training a `Transformer` model on the `Librispeech` dataset using 4 GPUs with 8 CPU-cores. I have tested the following: #### Single-GPU ```bash export CUDA_VISIBLE_D…

stefan-falk updated 2 years ago
8
tensorflow/tensor2tensor #1715

How to plot accuracy in a Text2Class problem?

### Description ... ### Environment information ``` OS: Version: tf-cpu.1-14.m34 Based on: Debian GNU/Linux 9.9 (stretch) (GNU/Linux 4.9.0-9-amd64 x86_64\n) Linux cpu1-vm 4.9.0-9-amd64 #…

abnf updated 5 years ago
2
tensorflow/tensor2tensor #1521

LM output not as expected

### Description I tried training a LM with both languagemodel_ptb10k and languagemodel_lm1b32k as target problems, both succeded without problems. The decoding part also seemed fine but the output re…

LFavano updated 5 years ago
6
tensorflow/tensor2tensor #1263

Decode with transaformer model and transformer_small hparam…

### Description after language model I have already trained, decode from file, I will always get this output: "pad>......" ### Environment information ``` OS: centos 7.3 $ pip freeze | grep …

Pydataman updated 5 years ago
8
tensorflow/tensor2tensor #939

WMT14 En-de Dataset and decoding result

### Description I'm trying to reproduce the En-De experiment in the paper "Attention is all you need". While, I'm confused by training data. The paper used the WMT14 training data, while the follow…

zhangjcqq updated 6 years ago
7
tensorflow/tensor2tensor #1807

TPU HBM OOM

Hi. I am trying to use TPUv2-8 to train a query classifier. However, I got some issues here about memory. Officially, it claims that TPUv2-8 has 64 GB memory. However, I kept getting this error w…

wppply updated 4 years ago
1

上一页 1...27 28 29 30 31 32 33...100 下一页

1000+ results
for tensor2tensor

Question distributed training parameter setting and perfor…

Use transformer encoder for sequence labeling

help Merge transformer with other layers

Add Graph len of 0-d tensor

Multi-GPU gives no speedup for transformer model

How to plot accuracy in a Text2Class problem?

LM output not as expected

Decode with transaformer model and transformer_small hparam…

WMT14 En-de Dataset and decoding result

TPU HBM OOM

1000+ results for tensor2tensor

1000+ results
for tensor2tensor