Open Tomccat3 opened 1 year ago
@danny0405 Hello Danny, can you help me with this problem?
@Tomccat3 Hudi should support it. Are you facing issues? What kind of schema changes you are having?
@Tomccat3 , schema evolution in Hudi is supported, but not automatically, you still need to alter the table schema manually and maybe restart the writing job.
Commenting for visibility as this might be a feature-request for implicit comprehensive schema evolution for Hudi-on-Flink.
@Tomccat3 , schema evolution in Hudi is supported, but not automatically, you still need to alter the table schema manually and maybe restart the writing job.
Thanks for your reply, i don't want to restart the writing job, can schema evolution be made automatic in flink datastream job?
@Tomccat3 Hudi should support it. Are you facing issues? What kind of schema changes you are having?
For example:
be made automatic in flink datastream job
No, the open source Flink does no support pass the schema around to the sink, that is a tricky part for Hudi to track the schema changes dynamically.
No, the open source Flink does no support pass the schema around to the sink,
ok, what if we implement a RowData with schema?
yeah, possible, you can pass around the schema together with the RowData, before each commit, you can overwrite the schema with the latest schema, only feasible for Flink DataStream API.
ok, i will try
ANy example o how to do this? How can you build the Hoodie Pipeline Builder dynamically from RowData?
@Tomccat3 Can you please provide an example of passing the schema along with the RowData? Currently, it seems to be tightly coupled with the HoodiePipelineBuilder with a static schema.
Tips before filing an issue
Have you gone through our FAQs?
Join the mailing list to engage in conversations and get faster support at dev-subscribe@hudi.apache.org.
If you have triaged this as a bug, then file an issue directly.
Describe the problem you faced
A clear and concise description of the problem.
I'm using flink ingest data from mysql to hudi,but I'm running into a problem: mysql table schema will change,i want implement automatic schema evolution, does hudi already support this requirement, or is it already in development
Steps to reproduce the behavior:
1. 2. 3. 4.
Expected behavior
A clear and concise description of what you expected to happen.
Environment Description
Hudi version :
Spark version :
Hive version :
Hadoop version :
Storage (HDFS/S3/GCS..) :
Running on Docker? (yes/no) :
Additional context
Add any other context about the problem here.
Stacktrace
Add the stacktrace of the error.