The pipeline-config.json is being replaced with a dictionary (CONFIGURATION in dpypelines/pipeline/configuration.py) containing the details required to run a transform on a specific dataset. These details will be accessed via a dataset_id, which is used as the CONFIGURATION dictionary key (actually a regex pattern that matches the dataset_id).
The dataset_id should be specified as part of the input metadata. This value can then be passed to the relevant dataset_ingress function (CONFIGURATION[secondary_function]), with the correct configuration details.
Make sure you understand how the values in CONFIGURATION will be used to configure a pipeline, and that all fields required to run dataset_ingress_v1() are present.
What
The
pipeline-config.json
is being replaced with a dictionary (CONFIGURATION
indpypelines/pipeline/configuration.py
) containing the details required to run a transform on a specific dataset. These details will be accessed via adataset_id
, which is used as theCONFIGURATION
dictionary key (actually a regex pattern that matches thedataset_id
).The
dataset_id
should be specified as part of the input metadata. This value can then be passed to the relevantdataset_ingress
function (CONFIGURATION[secondary_function]
), with the correct configuration details.How to review
Make sure you understand how the values in
CONFIGURATION
will be used to configure a pipeline, and that all fields required to rundataset_ingress_v1()
are present.Who can review
Anyone.