Making sure a good system is in place to pass trained models to application stages without needing to have separate systems in place for train vs test mode.
Modifying the launcher to include "test mode" code.
Figuring out a way to standardize the method of applying trained models to test data and getting results (we don't want a different system per model).
We need our pipeline to also be able to run against test data.