apache / airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
https://airflow.apache.org/
Apache License 2.0
37.05k stars 14.29k forks source link

Status of testing Providers that were prepared on September 08, 2023 #34218

Closed eladkal closed 1 year ago

eladkal commented 1 year ago

Body

Issue title: Status of testing Providers that were prepared on September 08, 2023

I have a kind request for all the contributors to the latest provider packages release. Could you please help us to test the RC versions of the providers?

The guidelines on how to test providers can be found in

Verify providers by contributors

Let us know in the comment, whether the issue is addressed.

Those are providers that require testing as there were some substantial changes introduced:

Provider airbyte: 3.3.2rc1

All users involved in the PRs: @Taragolis @fabiogra @vijay-jangir @darkag @okayhooni @moiseenkov @RyuSA @potiuk @yermalov-here @eladkal @bkossakowska @hussein-awala @pankajastro @adam133 @kristopherkane @pankajkoti @melugoyal @pierrejeambrun @JDarDagran @GeoffroyDFox @RNHTTR @mjsqu @dstandish @josh-fell @wolfdn @Lee-W

Committer

yermalov-here commented 1 year ago

33952 works as expected

Taragolis commented 1 year ago
melugoyal commented 1 year ago

33715 works as expected

JDarDagran commented 1 year ago

34122 works fine

33959 does not work properly, for DbtCloudRunJobOperator it raises:

[2023-09-08, 21:36:05 UTC] {base.py:152} ERROR - OpenLineage provider method failed to extract data from provider. 
Traceback (most recent call last):
  File "/usr/local/lib/python3.8/site-packages/airflow/providers/openlineage/extractors/base.py", line 137, in _get_openlineage_facets
    facets: OperatorLineage = get_facets_method(*args)
  File "/usr/local/lib/python3.8/site-packages/airflow/providers/dbt/cloud/operators/dbt.py", line 229, in get_openlineage_facets_on_complete
    return generate_openlineage_events_from_dbt_cloud_run(operator=self, task_instance=task_instance)
  File "/usr/local/lib/python3.8/site-packages/airflow/providers/dbt/cloud/utils/openlineage.py", line 60, in generate_openlineage_events_from_dbt_cloud_run
    run_id=operator.run_id, account_id=operator.account_id, include_related=["run_steps,job"]
AttributeError: 'DbtCloudRunJobOperator' object has no attribute 'run_id'

I'll raise an issue and try to solve this.

RNHTTR commented 1 year ago

33685 doesn't work for @task decorated operators, but I don't think that's a big deal since I don't think these kinds of tasks by themselves are that useful for lineage data, so I think this one is fine. @mobuchowski what do you think?

mobuchowski commented 1 year ago

@RNHTTR what if you disable _PythonDecoratedOperator?

hussein-awala commented 1 year ago

I checked all my 101 changes, and they are all present in the RC.

Lee-W commented 1 year ago

Tested https://github.com/apache/airflow/pull/33825, https://github.com/apache/airflow/pull/33822, https://github.com/apache/airflow/pull/34098

Will we be able to include documentation changes like https://github.com/apache/airflow/pull/34104, https://github.com/apache/airflow/pull/34103, https://github.com/apache/airflow/pull/34102, https://github.com/apache/airflow/pull/34101, https://github.com/apache/airflow/pull/34097, https://github.com/apache/airflow/pull/34096, https://github.com/apache/airflow/pull/34095, https://github.com/apache/airflow/pull/34094, https://github.com/apache/airflow/pull/34074, https://github.com/apache/airflow/pull/34073?

pankajkoti commented 1 year ago

Tested my change https://github.com/apache/airflow/pull/34018 in the Google RC 10.8.0rc1. It works fine, but, it also has a dependency on common-sql provider 1.7.2.rc1 for the change in same PR. If the common-sql provider is not updated then it fails. How do we handle cross-provider dependency bumps during releases? Does it get handled automatically or we need a manual minimum dependency bump here in Google RC to contain common-sql>=1.7.2?

hussein-awala commented 1 year ago

it also has a dependency on common-sql provider 1.7.2.rc1 for the change in same PR. If the common-sql provider is not updated then it fails.

IMHO if the operator will be broken before upgrading the common-sql version to latest, then we should consider it as breaking change and fix it. The min version of common-sql is 1.3.1 in google provider, bumping it to 1.7.2 could fix the issue, and I think that it's safe as we still use the same major version.

potiuk commented 1 year ago

Yep. We should bump min version of common-sql

pankajkoti commented 1 year ago

Thank you @hussein-awala and @potiuk for your quick inputs and suggestions. I have created a PR now to bump the min version https://github.com/apache/airflow/pull/34257.

cc: @eladkal Sorry din't realise this earlier. What would be the steps for releasing the Google RC now as it may depend on PR https://github.com/apache/airflow/pull/34257?

potiuk commented 1 year ago

Checked all my changes are in (mostly dependencies). All looks good.

eladkal commented 1 year ago

dbt.cloud will be excluded from this wave due to issues found. please keep testing rest of the providers

mobuchowski commented 1 year ago

https://github.com/apache/airflow/pull/33959 bug does not affect OpenLineage provider, just dbt one - we should only exclude this. I will doublecheck OL provider.

EDIT: https://github.com/apache/airflow/pull/34270 should fix this issue, maybe release from RC2?

moiseenkov commented 1 year ago

34068 works fine

darkag commented 1 year ago

34041 works as expected

josh-fell commented 1 year ago

34106 and #34130 look good!

eladkal commented 1 year ago

33959 bug does not affect OpenLineage provider, just dbt one - we should only exclude this. I will doublecheck OL provider.

EDIT: #34270 should fix this issue, maybe release from RC2?

I will cut rc2 for dbt.cloud

eladkal commented 1 year ago

Thank you everyone. dbt.cloud is excluded from this release. The rest of providers are released I invite everyone to help improve providers for the next release, a list of open issues can be found here.