microsoft / lst-bench

LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
Apache License 2.0
58 stars 32 forks source link

Add Apache Flink engine support #259

Open ikyranas opened 3 months ago

ikyranas commented 3 months ago

Motivation

Apache Flink is a popular framework offering querying capabilities and READ&WRITE operations to Apache Iceberg, Apache Hudi and Delta Lake. Unlike Spark, which can be used only for batch processing, Flink can be also used for streaming analytics applications.

Purpose

Is it of any interest to the team to integrate Apache Flink to the supported engines in the lst-bench?

Suggestion for PR

A PR to integrate Flink and be in accordance to the current repo's structure would need changes in the following:

If this aligns with the team's plans I could create the PR for this.

jcamachor commented 2 months ago

@ikyranas , thanks for opening this issue. This would be fantastic; we would love to have support for other popular OSS engines in LST-Bench. Also, it would be very valuable to add new workloads that are representative for Flink and not currently covered by our implementation. Please let us know if you hit any blockers. Looking forward to your contribution!