I have a dependency problem with a change I'm making to our Dataform pipeline. I'm adding a reference to a new table gcp_costs_excluded. This new table is a static BigQuery table, which we refer to in the pipeline with a declaration config. We use the same pattern for other source tables.
If I refer to this with the following simple intermediate table, it works fine:
SELECT
day,
sku,
FROM ${ref("gcp_costs_excluded")}
However, I want to add the following to an existing intermediate table - in order to filter out any rows that match the rows in the exclude table:
LEFT JOIN ${ref("gcp_costs_excluded")} exclude_list
ON simplified.sku = exclude_list.sku
AND TIMESTAMP_TRUNC(simplified.timestamp_start, DAY) = exclude_list.day
WHERE exclude_list.sku IS NULL
When I run the pipeline now I get the following error:
Error: Input for dataset "{"name":"gcp_costs_excluded"}" has not been provided. Provided inputs: {"name":"gcp_combined_billing_export"},{"name":"gcp_resource_categories"}
I have tried other variations of the sql snippet without luck. I have also tried adding the upstream tables as an dependency to the downstream table config. I am using Dataform version 2.9.0.
I have a dependency problem with a change I'm making to our Dataform pipeline. I'm adding a reference to a new table
gcp_costs_excluded
. This new table is a static BigQuery table, which we refer to in the pipeline with adeclaration
config. We use the same pattern for other source tables.If I refer to this with the following simple intermediate table, it works fine:
However, I want to add the following to an existing intermediate table - in order to filter out any rows that match the rows in the
exclude
table:When I run the pipeline now I get the following error:
I have tried other variations of the sql snippet without luck. I have also tried adding the upstream tables as an dependency to the downstream table config. I am using Dataform version 2.9.0.
Any help would be appreciated!