subword-units Search Results

217 results
for subword-units

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/nmt #112

finetuning the model with new data samples

I have trained a seq2seq NMT model (EN-DE) with 1M samples and saved the latest checkpoint. Now, I have some domain-specific data of 50K sentence pairs which has **not** been seen in previous training…

kmario23 updated 6 years ago
4
karakuri-ai/paper-readings #56

[2018] Subword Regularization: Improving Neural Network Tran…

## ざっくり言うと翻訳タスクの目的関数をsubword tokenizationの期待値として定式化し直し，tokenization結果をサンプリングしNMTを学習することで精度を上げた．サンプリングが正則化やdata augementationと似た役割を果たしている．tokenizationをサンプリングするために，決定論的ではなく確率的な扱いが出来るUnigram language m…

IkokObi updated 4 years ago
4
finalfusion/finalfrontier #10

Optimization opportunities

The vast majority of time during training is spent in the dot product and scaled additions. We have been doing unaligned loads so far. I have made a quick modification that ensures that every embeddin…

danieldk updated 5 years ago
1
howardyclo/papernotes #2

Linguistic Input Features Improve Neural Machine Translation

### Metadata - Authors: Rico Sennrich and Barry Haddow - Organization: School of Informatics, University of Edinburgh - Conference: WMT 2016 - Link: https://goo.gl/jqYQ8r

howardyclo updated 6 years ago
1
Amazing-J/structural-transformer #1

Data Preprocessing

Hi, thanks for the great work! I try to run the code. However, I don't know how to do data preprocessing for AMR corpus. May I ask how can I do data preprocessing?

Cartus updated 5 years ago
5
marian-nmt/marian #272

How does providing a vocabulary affect training BPE models?

I am currently training a transformer model and have followed the MTM labs to apply BPE to my own corpus. However, I'm unsure of the effect that providing a pre-determined vocabulary has. Does it impa…

hc09141 updated 5 years ago
6
tensorflow/nmt #134

Change vocabulary size during re-run

Hello, With a vocabulary size of 55K, I have trained the model for 200K steps and saved the latest checkpoint. Now, I increased my vocabulary size to 70K. 1. How can I continue training from the …

kmario23 updated 6 years ago
3
commonsense/conceptnet-numberbatch #64

meaning of number of # characters in subwords?

For continuation words. there are varying number of # signs. For example, in the 5 first words we have followings: - /c/de/####er - /c/de/###er - /c/de/##er For example, if I have a word endi…

ugurcanozalp updated 11 months ago
4
jbrry/Irish-BERT #62

Rename unusable vocabulary entries

There is at least one unusable vocabulary entry in our gabert vocab, namely `##-"`. Find all entries that the BERT will never use as BERT first splits around all non-alphanumeric characters without ap…

jowagner updated 3 years ago
8
TensorSpeech/TensorFlowASR #254

Error when using multi-GPU training: CUDA_ERROR_ILLEGAL_ADDR…

I am trying to train a Chinese model of a conformer. When I train with 4 2080ti, there will be an error in the middle of the epoch: CUDA_ERROR_ILLEGAL_ADDRESS: an illegal memory access was encountered…

JunZhan2000 updated 2 years ago
6

上一页 1...1 2 3 4 5 6 7...22 下一页

217 results for subword-units

217 results
for subword-units