Open FishMAN002 opened 1 year ago
@FishMAN002 Which Hudi-spark bundle jar you are using. Did you build by your own. Can you try with --packages org.apache.hudi:hudi-spark3.1-bundle_2.12:0.13.1
@FishMAN002 Which Hudi-spark bundle jar you are using. Did you build by your own. Can you try with --packages org.apache.hudi:hudi-spark3.1-bundle_2.12:0.13.1
@ad1happy2go I build it by my own. My build shell is
/usr/local/opt/apache-maven-3.8.5/bin/mvn clean package -DskipTests -Dspark3.1 -Dflink1.14 -Dscala-2.12 -Drat.skip=true -Dcheckstyle.skip=true
And hudi-spark-bundle-jar name is hudi-spark3.1-bundle_2.12-0.13.1.jar.
@FishMAN002 The build command looks good. Did you tried with --packages org.apache.hudi:hudi-spark3.1-bundle_2.12:0.13.1 . If you still get the same issue then at least that will rule out any jar compatibility issue.
I tried the exactly same command with --packages option and the CTAS worked good for me.
@ad1happy2go Are you suggesting that I try this command:
/usr/local/opt/apache-maven-3.8.5/bin/mvn clean package -DskipTests -Dspark3.1 -Dflink1.14 -Dscala-2.12 -Drat.skip=true -Dcheckstyle.skip=true **--packages org.apache.hudi:hudi-spark3.1-bundle_2.12:0.13.1**
If that's not right, could you provide me with a right command?Thank you very much !
@FishMAN002 Sorry for the delayed reply here. What I meant is directly to use the maven hudi package I provided when you try to open spark-shell/spark-sql/pyspark. I hope you would have been figured that out yet. Are you still facing this issue? Please let us know.
Describe the problem you faced
ctas error in spark3.1.1 & hudi 0.13.1
To Reproduce
Steps to reproduce the behavior:
spark-sql shell
ctas shell
error
Environment Description
Hudi version : 0.13.1
Spark version : 3.1.1
Hive version : 3.1.2
Hadoop version : 3.2.1
Storage (HDFS/S3/GCS..) : s3
Running on Docker? (yes/no) : no
Another [HUDI-3131] fix ctas error in spark3.1.1 #4549 @YannByron Can you help me check it ?