Closed baibaichen closed 19 hours ago
Introducing DeltaStats to collect stats as delta does.
DeltaStats
(Fixes: #7028)
Using Existed Uts
In test("test parquet table write with the delta"), adding logic to verify delta stats
test("test parquet table write with the delta")
if (spark35) { val vanillaTable = "lineitem_delta_parquet_vanilla" withSQLConf((GlutenConfig.NATIVE_WRITER_ENABLED.key, "false")) { doInsert(drop(vanillaTable), createLineitem(vanillaTable), insert(vanillaTable)) } val expected = DeltaStatsUtils .statsDF( spark, s"$basePath/$vanillaTable/_delta_log/00000000000000000001.json", q1SchemaString) .collect() checkAnswer( DeltaStatsUtils.statsDF( spark, s"$basePath/$table/_delta_log/00000000000000000001.json", q1SchemaString), expected ) }
https://github.com/apache/incubator-gluten/issues/7028
Run Gluten Clickhouse CI on x86
What changes were proposed in this pull request?
Introducing
DeltaStats
to collect stats as delta does.(Fixes: #7028)
How was this patch tested?
Using Existed Uts
In
test("test parquet table write with the delta")
, adding logic to verify delta stats