apache / kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
https://kyuubi.apache.org/
Apache License 2.0
2.08k stars 908 forks source link

[Umbrella] Suppors LakeSoul integration #4355

Open SteNicholas opened 1 year ago

SteNicholas commented 1 year ago

Code of Conduct

Search before asking

Describe the proposal

LakeSoul implements incremental upserts for both row and column and allows concurrent updates. LakeSoul uses LSM-Tree like structure to support updates on hash partitioning table with primary key, and achieve very high write throughput (30MB/s/core) on cloud object store like S3 while providing optimized merge on read performance. LakeSoul scales metadata management and achieves ACID control by using PostgreSQL. LakeSoul provides tools to ingest CDC and log streams automatically in a zero-ETL style. Kyuubi could support the integration with LakeSoul.

Task list

Suppors LakeSoul integration.

Are you willing to submit PR?

Asakiny commented 1 year ago

Hi, @SteNicholas LakeSoul can be integrated in Kyuubi now, both support Spark and Flink SQL Query Engine.

Please refer to : https://lakesoul-io.github.io/docs/Usage%20Docs/setup-kyuubi,

Maybe we could add docs to Kyuubi.

SteNicholas commented 1 year ago

@Asakiny, could you add document of LakeSoul integration in Kyuubi docs?