RImpala is an R package that helps you to connect and execute distributed queries using Cloudera Impala. Impala supports jdbc integration and this feature is used by RImpala to establish a connection between R and Impala.
To use this package you must also have access to a Hadoop cluster running Cloudera Impala with at least one populated table defined in the Hive Metastore.
rimpala.init()
RImpala-0.1.6.tar.gz
present inside install
directory
tar -xvf install/RImpala_0.1.6.tar.gz
R CMD INSTALL ./RImpala
library("RImpala")
rimpala.init(libs="/path/to/JDBC/jars/")
result = rimpala.query("your query");
by default rimpala.init() searches "/usr/lib/impala" for the JDBC jars.Here are links to more information on Cloudera Impala: