Closed HariprasadAllaka1612 closed 4 years ago
Looks like a schema mismatch.. did you change a number to a string for .eg?
cc @lamber-ken @leesf any of you , interested in helping here? :)
We can close this issue. This is a problem of having the parquet and hive table synced to parquet file having 2 different schemas. Its fixed by forcing the parquet schema always equal hive meta store,
Thank you.
Parquet schema changing for various writes to Hudi.
With the continuous writes to S3 in Hudi format, there are instance the schema of Paruet file is changing and when writing/upserting to same partition we are getting a merge error, I am using COW storage format.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Environment Description
Hudi version : 0.5.1
Spark version :2.4.0
Hive version : 2.3.4
Hadoop version : 2.8.5
Storage (HDFS/S3/GCS..) : S3
Running on Docker? (yes/no) : No