kite-sdk / kite

Kite SDK
http://kitesdk.org/docs/current/
Apache License 2.0
394 stars 263 forks source link

csv-import from hdfs fails #450

Closed ghost closed 8 years ago

ghost commented 8 years ago

I am trying to run kite-dataset csv-import hdfs:/user/Florian/ratings.csv ratings

but it fails with the following error message:

org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:hive://{fullyqualifiedname}:9083/default/ratings
Check that JARs for hive datasets are on the classpath

I have created a schema and a table by running:

kite-dataset csv-schema ratings.csv -o rating.avsc
kite-dataset create ratings --schema rating.avsc --partition-by year-month.json --format parquet

I use Kite version "1.0.0-cdh5.8.0"

ghost commented 8 years ago

I tried it on a different cluster and it works. Looks like it was a cluster configuration problem.