tensor2tensor Search Results

1000+ results
for tensor2tensor

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/tensor2tensor #944

Distributed Training with 4 machines on Translation Task

Hello , I want to do distributed training using four machines ,each one has 8 1080ti GPUs on En-Zh translation task, and the t2t-version is 1.6.5. I have seen the other similar issues , and the di…

libeineu updated 6 years ago
10
google-research/pegasus #12

evaluate.py not using GPU

I ran the setup instructions on a preixisting GCP machine with cuda 10.1 and one modification: ```bash mv ckpt/pegasus_ckpt ckpt2 ``` (Instructions don't work as written because they don't acknowl…

sshleifer updated 4 years ago
5
tensorflow/tensor2tensor #1415

Load checkpoint for transformer_parallel model

### Description I am trying to follow the approach as mentioned in paper: block parallel decoding for deep autoregressive models. It states that firstly the model is trained on transformer model for …

goyalrasna updated 5 years ago
1
tensorflow/tensor2tensor #582

Setting max_length low makes BLEU unexpectedly worse

Sentences longer than the parameter `max_length` are excluded from training and lowering this parameter helps to prevent [OOM errors](#581) and allows to use [higher `batch_size`](https://github.com/t…

martinpopel updated 6 years ago
4
tensorflow/tensor2tensor #1696

not use all gpu memory - t2t training

### Description why not use all gpu memory - t2t training? nvidia-smi reports low GPU ![image](https://user-images.githubusercontent.com/32744746/64552801-fc0db500-d372-11e9-8864-5104b5c16b59.p…

7DaeBum updated 4 years ago
2
tensorflow/tensor2tensor #1007

got ValueError when training transformer_ae model using tran…

I'd like to train a transformer_ae model using translate_ende_wmt32k problem. Parts of my commands is copied as below. ``` PROBLEM=translate_ende_wmt32k MODEL=transformer_ae HPARAMS=transformer…

baoalita updated 5 years ago
3
tensorflow/tensor2tensor #913

Context for dialog models

There has been a lot of advancements recently in achieving context for dialog models through a separate context layer. Eg. [HRAN](https://arxiv.org/pdf/1701.07149.pdf) or [VHRED](http://www.cs.toronto…

JohannesTK updated 6 years ago
1
at16k/at16k-t2t-helpers #5

Fine-tuning at16k-t2t model for domain-specific entries

I'm relatively new to t2t and was studying leveraging it for ASR when I came across your work. Amazing work done @mohitsshah with proper explanation over at16k. The results are pretty impressive. I'…

sauravjoshi updated 4 years ago
1
anniesch/dvd #1

Meta-World version for reproducing results from the paper

Hi, the link to Meta-World in the README is broken. Do you have another reference to the version you used? Or will I be able to reproduce your results by cloning Meta-World from the master branch o…

minttusofia updated 8 months ago
5
bilal2vec/lm-training-research-project #1

Run Adafactor with Square root Scheduler

Hello, Thank you for your work. I am interested in your AdaFactor implementation. I want to use the same training hyper-parameters from PEGASUS (https://arxiv.org/pdf/1912.08777.pdf) to train my mode…

JessicaLopezEspejel updated 3 years ago
1

上一页 1...24 25 26 27 28 29 30...100 下一页

1000+ results for tensor2tensor

1000+ results
for tensor2tensor