Open eldar-elne opened 4 months ago
Hi, I'm trying to tag my df using s3.parquet. What happens is when: output file size is < 100mb - they are getting tagged output file size is > 100mb - they are NOT getting tagged
s3.parquet
Infra: EMR 6.14.0 Spark 3.4.1
df \ .repartition(600) \ .write \ .partitionBy(YEAR_COLUMN, MONTH_COLUMN, DAY_COLUMN) \ .mode("overwrite") \ .format("s3.parquet") \ .option("tags", tags) \ .save(path)
LMK if any extra info is needed
Hi, I'm trying to tag my df using
s3.parquet
. What happens is when: output file size is < 100mb - they are getting tagged output file size is > 100mb - they are NOT getting taggedInfra: EMR 6.14.0 Spark 3.4.1
LMK if any extra info is needed