Open bithw1 opened 17 hours ago
what environment did you run the spark-sql. There must be spark configs somewhere on where to sync.
If the table is registered in HMS or glue or some data catalog, it isn't an external table.
what environment did you run the spark-sql. There must be spark configs somewhere on where to sync.
If the table is registered in HMS or glue or some data catalog, it isn't an external table.
I am using Centos7,Spark 3.3,2, I don't make extra configuration for spark and spark sql, except copying hive-site.xml under the spark conf dir
It has to register the tables somewhere. It may be the spark bundle you're using. If your use case is to how things are supposed to work, you can check out the new hudi docker demo that is being built. https://github.com/alberttwong/onehouse-demos/tree/main/hudi-spark-minio-trino
I am trying Hudi 0.15.0 and spark 3.3.0.
I have put the hive-site.xml under my $SPARK_HOME/conf, and I startup the spark sql with following command:
Then, I create a table with following DDL from spark-sql cli:
The table is successfully created,but I got two questions here.
hoodie.datasource.meta.sync.enable
orhoodie.datasource.hive_sync.mode
or sth else, I would ask how this could happen.