nexr / RHive

RHive is an R extension facilitating distributed computing via Apache Hive.
http://nexr.github.io/RHive
122 stars 63 forks source link

rhive.connect() error: java.sql.SQLException: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.FunctionTask #68

Open montxo5 opened 10 years ago

montxo5 commented 10 years ago

Hi,

I'm trying to use RHive and connect but it throws this error: Error: java.sql.SQLException: Error while processing statement: FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.FunctionTask

My code is:

library(RHive) Sys.setenv("HIVE_HOME" = "/opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/lib/hive/") Sys.setenv("HADOOP_HOME" = "/opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/lib/hadoop/") Sys.setenv("HADOOP_CONF_DIR" = "/opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/etc/hadoop/") Sys.setenv("RHIVE_FS_HOME" ="/rhive")

rhive.init(hiveLib = "../../rstudio/R/libs/hive",hadoopConf = "/opt/cloudera/parcels/CDH-4.6.0-1.cdh4.6.0.p0.26/etc/hadoop/") rhive.env() rhive.connect(host = "xx.xx.xx.xx",port = "10000",user = "xxxx", hiveServer2 = TRUE)

Is there any way to know the exact error? Thanks.

samuelzhou commented 10 years ago

Maybe RHive is trying to "create function" for UDFs. You need to check the log of HiveServer2 to find which statement is failed to execute.

ghost commented 9 years ago

We've had a similar issue that arose due to R not having sufficient permissions in hdfs.