Closed netsensei closed 6 years ago
This is a non-issue.
We can implement this perfectly by using a YAML exporter in the pipeline configuration instead of a Solr or Datahub exporter. The former exporter will output to STDOUT by default. The output could be piped into a flat file on the command line.
Closing this.
Should be possible to do a dry run. Which means
But do not export data to a data sink (datahub, solr)
Goal: test whether or not all records will be fixed correctly before pushing them to the datahub. Allows for data cleaning up front.