mjc92 / CopyNet

An implementation of "Incorporating copying mechanism in sequence-to-sequence learning"
153 stars 46 forks source link

Where are the correct models #1

Open mojesty opened 6 years ago

mojesty commented 6 years ago

Hello! Thanks for open-sourcing the realization of code for this paper! I can't understand what file contains the correct implementations: models/copynet.py or models/copynet_dbg.py: the former is older but I'm still not sure. Thanks!

mojesty commented 6 years ago

Also, I have some questions about the realization itself. As in both files the decoder returns tensor of shape [batch_size, vocab_size], it's not clear for me how the "pure copy" mode (when the word is from the source sequence but not from the vocab) can be enabled. Also, I wonder why you use argmax for testing phase: it usually makes models saturate to output just the the the or unk unk unk