microsoft / data-factory-testing-framework

A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure Synapse Analytics.
MIT License
92 stars 21 forks source link

DataFlow Activity Support #140

Open deenairn opened 2 weeks ago

deenairn commented 2 weeks ago

The framework looks like it does a great job of testing individual activities in Pipelines.

Feature Request However, the ADF DataFlow Activity can have some quite complex transformations in place, and it would be great if there were a way to substitute the DataFlow data source for a fixed string, and then test that the transformation against this (JSON for JSON data type, array of arrays for delimited text or database types, XML for XML data types, etc) so you can provide a reasonable unit test for complex transformations of data.

i.e. for a DataFlow that takes in a JSON data type in Blob Storage and outputs to a Delimited Text in Blob Storage, you can do something like

setup activity with simple JSON like:

[ { "name": "Donald", "age": 21 }]

assert that it returns what you expect via checking against a CSV like

Name,Age
Donald,21