Feature/guidedalignment

This request adds the implementation of the guided alignment as described in Guided Alignment Training for Topic-Aware Neural Machine Translation (Chen et al. 2016)

In summary:

preprocess.py: optionally stores source-to-target alignments in parse format (-alignfile, -alignvalfile)
s2sa/data.lua: if present, loads the alignments and converts them into dense format per batch
train.py: optionally creates a parallel criterion consisting of the decoder criterion (ClassNLLCriterion) with the criterion for guided alignment (MSECriterion) (--guided_alignment, --guided_alignment_weight, --guided_alignment_decay)
s2sa/models.lua: optionally exposes attention output in the decoder model

harvardnlp / seq2seq-attn