word dropout or word embedding dropout?

timbmg / Sentence-VAE

PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349

580 stars 152 forks source link

word dropout or word embedding dropout? #5

Closed mengxuehu closed 5 years ago

mengxuehu commented 5 years ago

According to the papar, "We do this by randomly replacing some fraction of the conditioned-on word tokens with the generic unknown word token unk", but it seems that this implementation is dropping out word embedding.

nick11roberts commented 5 years ago

According to this line, dropout is being applied to the sequence of embeddings after they have been looked up in the embedding.

I'm not sure if this is right, because I think that this has the effect of randomly dropping elements of the embedding vector corresponding to a token rather than dropping whole embedding vectors (post lookup). Instead, whole vectors along the sequence dimension should be dropped out.

Furthermore, I'm pretty sure that the 0 vector doesn't necessarily correspond to the unk token since the embeddings are learned.

timbmg commented 5 years ago

Right, this is different from the paper. Unfortunately, I don't have the time to fix this right now. But feel free to do a PR. If you do, please also keep the dropout version as an option.

nguyenvo09 commented 5 years ago

How about testing phase? We do need to disable dropout right? Based on current implementation, dropout still works in the testing phase.

timbmg commented 5 years ago

This is done here

# Enable/Disable Dropout
if split == 'train':
    model.train()
else:
    model.eval()