Open algorithmy1 opened 1 month ago
Thanks for the suggestion, @algorithmy1! We couldn't agree more on the benefits you outlined. The good news is that we've been prototyping such an endpoint for OpenLineage batch events, see v2.LineageResource.collectBatchOf(BatchOfEvents)
. The endpoint will be available in Marquez 0.51.0
.
Currently, the Marquez API for OpenLineage events (
/api/v1/lineage
) accepts one event per request, as seen in OpenLineageResource.java#L67. While this is suitable for real-time ingestion, it becomes inefficient when we need to ingest multiple events simultaneously.Use Case:
Proposal:
/api/v1/lineage/batch
) that accepts an array of OpenLineage events.OpenLineageResource
class to handle a list of events in a single request.(Or even update the current one
/api/v1/lineage
to accept both options)Benefits: