BPE-Dropout question - Githubissues

rsennrich / subword-nmt

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

MIT License

2.18k stars 464 forks source link

Hi,

I'm trying to implement BPE dropout using the tecnique you mention in the README, by creating an augmented training dataset by concatenating the original training (5K sentences) dataset multiple times, and then applying BPE dropout on this. I'm just wondering do I have to apply the "learn BPE" method on the concatenated dataset or does it suffice to learn BPE on the original 5K dataset, and then to simply apply BPE with the dropout probability on the concatenated dataset using the vocabulary learned on the original dataset?

rsennrich / subword-nmt

BPE-Dropout question #104