wjohnson / pyapacheatlas

A python package to help work with the apache atlas REST APIs
https://wjohnson.github.io/pyapacheatlas-docs/latest/
MIT License
170 stars 96 forks source link

Process info #263

Closed Vamshi7337 closed 9 months ago

Vamshi7337 commented 11 months ago

https://github.com/wjohnson/pyapacheatlas/blob/f5fa20c8042b019b670d32542c068f558ed0d9e9/samples/CRUD/create_entity_and_lineage.py#L57

what if there is no process between two entities and what will be the typeName, qualifiedname and name should be given @wjohnson

wjohnson commented 9 months ago

Hi, @Vamshi7337 ! Thank you for the question. The normal way to do this in Apache Atlas / Microsoft Purview is to include a Process asset between the two objects. In Apache Atlas, you'd normally create some dummy process that just has to sit between the two.

In Microsoft Purview they introduced "Manual Lineage" which added a Sources and Sinks relationship attribute and a trick in the Microsoft Purview Lineage UI (and I believe /lineage API) that will render the lineage graph so that a Source appears as an Input to the Sink.

See this sample I provided on Purview Lineage with No Process but be warned this is technically undocumented and may change in unexpected ways.