Closed cliu587 closed 8 years ago
@sb2nov do you know why we set output_node=base_output_node
at https://github.com/coursera/dataduct/pull/179/files#diff-59074e91ee415f9f629abf53692c99b4L114?
It seems self._output
is potentially different from base_output_node
as per the computation in L103.
If we used self_output it will create multiple staging directories instead what we want is a single staging directory that gets mapped to multiple nodes based on subdirectories so that the command doesn't need to figure out which staging directory maps to what output and is easier to manage.
Let me know if you want more details on it.
LGTM though
PTAL @sb2nov, @darinyu-coursera. Will land after the new load_reload_pk step is tested and with this diph.