This library exposes the logic to query: HDFS, KUDU and OpenTSDB. In particular it allows to:
logical_uri
logical_uri
Given a Dataframe, it can
columns
where
conditionsgroupBy
on a grouping column applying multiple grouping conditions of type column -> aggregation function.The supported aggregation functions are: ["count", "max", "mean", "min", "sum"] All the implemented operations takes as input a Try[DataFrame] and a list of parameters
import org.apache.spark.sql.DataFrame
import scala.util.{Failure, Success, Try}
def implementedFunction(df: Try[DataFrame], params: Any*): Try[DataFrame] = ???
To check how to use this library please take a look at the tests.
... WIP
All the test can be executed by running:
sbt test