Closed xiangyuf closed 2 weeks ago
@YannByron @Aitozi Hi, would you kindly review this
Do we need to support the table with same name in different db/catalog? Just like flink's global option do. https://github.com/apache/paimon/pull/2104
Do we need to support the table with same name in different db/catalog? Just like flink's global option do. #2104
I think we should find an unified way to unify Flink and Spark.
Do we need to support the table with same name in different db/catalog? Just like flink's global option do. #2104
I think we should find an unified way to unify Flink and Spark.
@Aitozi @JingsongLi Thx for reply. +1 for unify this.
@JingsongLi @Aitozi
Hi, I've unified flink and spark to support both dynamic table options and global options:
Global options format:
Flink:{config_key}
Spark: spark.paimon.{config_key}
.
Table options format:
Flink:paimon.${catalogName}.${dbName}.${tableName}.${config_key}
Spark: spark.paimon.${dbName}.${tableName}.{config_key}
Dynamic table options will override global options if there are conflicts.
WDYT?
@Aitozi I’ve updated the dynamic global options format for Flink as {config_key}
instead of paimon.{config_key}
@Aitozi I’ve updated the dynamic global options format for Flink as
{config_key}
instead ofpaimon.{config_key}
Get it, LGTM
@JingsongLi @Aitozi Hi, I've unified flink and spark to support both dynamic table options and global options: Global options format: Flink:
{config_key}
Spark:spark.paimon.{config_key}
.Table options format: Flink:
paimon.${catalogName}.${dbName}.${tableName}.${config_key}
Spark:spark.paimon.${dbName}.${tableName}.{config_key}
Dynamic table options will override global options if there are conflicts.
WDYT?
Why flink contains ${catalogName}, but spark not
@JingsongLi @Aitozi Hi, I've unified flink and spark to support both dynamic table options and global options: Global options format: Flink:
{config_key}
Spark:spark.paimon.{config_key}
. Table options format: Flink:paimon.${catalogName}.${dbName}.${tableName}.${config_key}
Spark:spark.paimon.${dbName}.${tableName}.{config_key}
Dynamic table options will override global options if there are conflicts. WDYT?Why flink contains ${catalogName}, but spark not
@Zouxxyy Updated Spark table option format as:
spark.paimon.${catalogName}.${dbName}.${tableName}.${config_key}
@Zouxxyy @JingsongLi CI has passed, please take a look.
Purpose
Linked issue: close #4371
In some cases, users may want to use spark time travel by setting properties like set
spark.paimon.scan.tag-name=tag_3
. However, this property will take effect globally if the spark job read multiple tables at the same time.It would be better if we can support table options via sql conf for Spark Engine. So user can specify different time travel options for different table, like this:
Tests
API and Format
Documentation