I'm experiencing some incompatibility with Json SerDe and Partitioning, here's
an example query :
CREATE TABLE clicks (
condition_set STRING,
creative STRING,
date_created STRING,
from_app STRING,
from_campaign STRING,
meta_country STRING,
meta_model STRING,
meta_os STRING,
to_app STRING,
to_campaign STRING,
uuid STRING,
`time` STRING,
`hour` STRING
)
PARTITIONED BY (`date` STRING)
ROW FORMAT
SERDE 'com.amazon.elasticmapreduce.JsonSerde'
WITH SERDEPROPERTIES ('paths'='
condition_set,
creative,
date_created,
from_app,
from_campaign,
meta_country,
meta_model,
meta_os,
to_app,
to_campaign,
uuid,
time,
date,
hour')
LOCATION '/mnt/hdfsmall/'
;
Error is : Error in metadata: org.apache.hadoop.hive.ql.metadata.HiveException:
MetaException(message:org.apache.hadoop.hive.serde2.SerDeException Expected a
one-one correspondance between paths 14 and columns 13)
If I add the column in the table column's definition, the partitioning will
give me an error too.
I think I tried every possibility in Hive to go around the problem. I have no
clue anymore on how to solve this So I think it's just not possible.
Original issue reported on code.google.com by flav...@chartboost.com on 1 Dec 2011 at 7:01
Original issue reported on code.google.com by
flav...@chartboost.com
on 1 Dec 2011 at 7:01