OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Describe the bug
running lineage ingestion from OM UI using airflow.
Running the lineage using SDK.
workflow = MetadataWorkflow.create(workflow_config_lineage)
Expected behavior
airflow is able to get the kineage required. but SDK is not.
I am not having any error msg while ingesting the data.
Yet after successful completion of lineage ingestion I do not find any lineage information.
Cannot see lineage while using SDK
[2024-09-23 15:39:55] INFO {metadata.Ingestion:lineage_source:75} - Scanning query logs for 2024-09-08 - 2024-09-24
[2024-09-23 15:40:00] DEBUG {metadata.Utils:sql_lineage:393} - Running lineage with query: INSERT INTO `rightdata-1376`.rddatasets.Client_Region_Data (
Client_ID,
Client_Name,
Client_Address,
Client_Phone,
Client_Balance,
Client_Segment,
Nation_Name,
Region_Name,
Comment
)
SELECT
C.C_CUSTKEY AS Client_ID,
C.C_NAME AS Client_Name,
C.C_ADDRESS AS Client_Address,
C.C_PHONE AS Client_Phone,
C.C_ACCTBAL AS Client_Balance,
C.C_MKTSEGMENT AS Client_Segment,
N.N_NAME AS Nation_Name,
R.R_NAME AS Region_Name,
C.C_COMMENT AS Comment
FROM `rightdata-1376`.tpch_1.Customer C
JOIN `rightdata-1376`.tpch_1.Nation N
ON C.C_NATIONKEY = N.N_NATIONKEY
JOIN `rightdata-1376`.tpch_1.Region R
ON N.N_REGIONKEY = R.R_REGIONKEY
[2024-09-23 15:40:03] DEBUG {metadata.Utils:sql_lineage:393} - Running lineage with query: INSERT INTO `rightdata-1376`.STAGE_Finance.Client (
Client_ID,
Client_Name,
Client_Type,
Client_Industry,
Client_Region,
Client_Segment
)
SELECT
C_CUSTKEY AS Client_ID,
C_NAME AS Client_Name,
-- Assuming you don't have exact data for the following columns, you can set them as NULL or use placeholders
'Unknown' AS Client_Type,
'Unknown' AS Client_Industry,
'Unknown' AS Client_Region,
C_MKTSEGMENT AS Client_Segment
FROM `rightdata-1376`.tpch_1.Customer
[2024-09-23 15:40:04] DEBUG {metadata.Utils:sql_lineage:393} - Running lineage with query: INSERT INTO `rightdata-1376`.STAGE_Finance.Client (
Client_ID,
Client_Name,
Client_Type,
Client_Industry,
Client_Region,
Client_Segment
)
SELECT
C.C_CUSTKEY AS Client_ID,
C.C_NAME AS Client_Name,
'Unknown' AS Client_Type, -- Placeholder, adjust as needed
'Unknown' AS Client_Industry, -- Placeholder, adjust as needed
N.N_NAME AS Client_Region, -- Using Nation name as Client_Region
C.C_MKTSEGMENT AS Client_Segment
FROM `rightdata-1376`.tpch_1.Customer C
JOIN `rightdata-1376`.tpch_1.Nation N
ON C.C_NATIONKEY = N.N_NATIONKEY
[2024-09-23 15:40:04] DEBUG {metadata.Utils:sql_lineage:393} - Running lineage with query:
INSERT INTO `rightdata-1376.rddatasets.Client_Region_Data` (
Client_ID,
Client_Name,
Client_Address,
Client_Phone,
Client_Balance,
Client_Segment,
Nation_Name,
Region_Name,
Comment
)
SELECT
C.C_CUSTKEY AS Client_ID,
C.C_NAME AS Client_Name,
C.C_ADDRESS AS Client_Address,
C.C_PHONE AS Client_Phone,
C.C_ACCTBAL AS Client_Balance,
C.C_MKTSEGMENT AS Client_Segment,
N.N_NAME AS Nation_Name,
R.R_NAME AS Region_Name,
C.C_COMMENT AS Comment
FROM `rightdata-1376`.tpch_1.Customer C
JOIN `rightdata-1376`.tpch_1.Nation N
ON C.C_NATIONKEY = N.N_NATIONKEY
JOIN `rightdata-1376`.tpch_1.Region R
ON N.N_REGIONKEY = R.R_REGIONKEY
Version:
OS: Linux - Ubuntu 20.04
Python version: 3.8
OpenMetadata version: 1.2.5
Additional context
Add any other context about the problem here.
Affected module Ingestion Framework
Describe the bug running lineage ingestion from OM UI using airflow. Running the lineage using SDK. workflow = MetadataWorkflow.create(workflow_config_lineage)
Expected behavior airflow is able to get the kineage required. but SDK is not. I am not having any error msg while ingesting the data. Yet after successful completion of lineage ingestion I do not find any lineage information. Cannot see lineage while using SDK
DEBUG logs:
Version:
Additional context Add any other context about the problem here.