Data generator prep take two

This PR separates updating model state from creating data arrays. The main changes here are in Trainer and TextTrainer, and the rest of the changes are just consequences of that API change.

The point of this is that it gives us a simpler interface around getting data arrays, which is easier to then swap out with a data generator. In this design, we're still assuming that you can fit your whole dataset into memory, both in plain text and as indexed instances, but we don't do padding over the whole dataset (if you use a generator, which isn't implemented in this PR; that'll be the next one).

allenai / deep_qa

Data generator prep take two #293