subword-units Search Results

yoheikikuta/paper-reading #20

[1508.07909] Neural Machine Translation of Rare Words with …

## 論文リンク https://arxiv.org/abs/1508.07909 ## 公開日（yyyy/mm/dd） 2015/08/31 ## 概要機械翻訳のモデルで rare word に対応するために、単語よりも小さい sub-word レベルを最小単位として扱うことを提案した論文。 rare word は例えば複合語などがあり、これは単語レベルで見ると確かに ra…

yoheikikuta updated 3 weeks ago

AlexisTercero55/AI-Research #14

Byte Pair Encoding on MWT 14 EN2DE

# BPE as input tokens of the transformer model The transformer model proposed by "_Attention is all you need_" encodes the 4.5M sentence input data into a small vocabulary generated by learning sha…

AlexisTercero55 updated 6 months ago

neulab/xnmt #524

Significant number of "unks" even when using subword units

I've noticed in several of my experiments that xnmt is generating "s" even when using sentencepiece, which is strange. This issue is a reminder to myself to check why this is happening and see if it's…

neubig updated 6 years ago

Albert-Ma/study-notes #15

BPE, various NORM in deep learning

Neural Machine Translation of Rare Words with Subword Units dropout batch normalization layer norm

Albert-Ma updated 5 years ago

flashlight/flashlight #346

Add support for subword vocab training/tokenization/detok

### Feature Description As today, the standard way to build a vocabulary of (usually subword) units for text inputs is to pretrain a model capable to generate a list of most adequate subwords units f…

WilliamTambellini updated 3 years ago

TanUkkii007/papers-i-read #125

A Simple, Fast, and Effective Reparame- terization of IBM Mo…

link: http://repository.cmu.edu/lti/48/ referenced from: - Neural Machine Translation of Rare Words with Subword Units #79 (for a bilingual dictionary based on fast-align)

TanUkkii007 updated 6 years ago

AlexisTercero55/AI-Research #12

Attention is all you need dataset wmt 2014 english-german d…

# Requirements - [ ] Read about input embeddings technique (byte-pair encoding) used by Google's team on "Attention Is All You Need" paper. - [ ] Design the input embeddings pipeline for **wmt 2014 e…

AlexisTercero55 updated 3 months ago

aalto-speech/subword-kaldi #5

How to generate data/subword_dict

Hi, I am trying to use the subword units along Kaldi librispeech recipe. I have used the code snippet mentioned in the README in the stage 3 of librispeech recipe. ``` if [ $stage -le 3 ]; then …

chitralekhabhat updated 2 years ago

a1da4/paper-survey #5

Reading: Treat the Word As a Whole or Look Inside? Subword E…

## 0. Paper @inproceedings{xu-etal-2019-treat, title = "Treat the Word As a Whole or Look Inside? Subword Embeddings Model Language Change and Typology", author = "Xu, Yang and Zhan…

a1da4 updated 4 years ago

deeplearningfromscratch2/deep-learning-from-scratch-2 #7

[Ch4] Mini Sessions

각자 소소하게 주제를 정해주세요~

heejour updated 5 years ago

217 results for subword-units

217 results
for subword-units