Expose intermediate repr, attention weight, and other meaningful tensors

Issue by HaokunLiu Sunday Sep 08, 2019 at 18:19 GMT Originally opened as https://github.com/nyu-mll/jiant/issues/910

We may want to save some tensors in the model to a file, for further study. This is needed for some error analysis, and many other analysis methods. Yet there are a few things we need to think through.

Saving either the entire computation graph, or over the full training process, or in some extreme cases, over full dataset, will consume crazy amount of storage. So we need some way to flexibly select parts we are interested with on all the three sides.

If a user can select them in config file, without the need to modify the code, that would the most desirable.

I don't clearly know how should we design it, especially the computation graph part.

nyu-mll / jiant-v1-legacy

Expose intermediate repr, attention weight, and other meaningful tensors #910