Closed Kangho-Lee closed 9 months ago
@Kangho-Lee This is happening as in the new schema, _hoodie_is_deleted
is non-nullable column which can't be evolved as existing data dont have this column and need to set as null.
For these cases, we introduced a new prop starting 0.14.0 in this PR - https://github.com/apache/hudi/pull/9262 so add this configuration 'hoodie.datasource.write.new.columns.nullable' as true.
@ad1happy2go thanks to reply.
I tried to add hoodie.datasource.write.new.columns.nullable
configuration.
For the new hoodie.streamer.source.dfs.root
data, no merge error occurred and the desired result was obtained.
But, Is there any way to change the value of an existing (parquet) data in an existing table? Existing data is not applied by transformer sql and the value is replaced with null.
hoodie.streamer.transformer.sql=SELECT *, if(__deleted = "false" or __deleted is null, cast(false as boolean), cast(true as boolean)) end AS _hoodie_is_deleted FROM <SRC> a
@Kangho-Lee So you want to update the old data also, The only way is to re-ingest that old data so it follow the upsert path again with _hoodie_is_deleted
and do the needful. Thanks.
@ad1happy2go It looks like doing a re-ingest or creating a new table same. Thank you for answer.
@Kangho-Lee Thanks for the answer. Closing the issue then. Please create a new one for any further issues/queries.
Hello guys. this post is from january 2020, any updates about deletion with deltastreamer? Is there a way to avoid having to add this field to incoming records for existing hudi table? I want hard delete an existing table rather than creating a new table with
_hoodie_is_deleted
field.When I tried to change the schema by adding the _hoodie_is_deleted field to the existing table by using transformer sql like
an error occurred when merging the parquet files in the existing partition directory like
I use hudi version 0.14.0 and conf is below (Delete personal/company information)
spark-submit is below
Using transformer sql above with creating a new table is works what I want. However, when using the above method on an existing hudi table, a merge error occurs. Please help me to solve.