Closed tomercagan closed 5 years ago
@tomercagan sorry for the delayed response - just wanted to let you know what I'm doing a bit of research to better understand your issue.
It looks like Hive is attempting to vectorize the binary output of st_point
? Can you try setting hive.vectorized.execution.enabled
to false
and see if you get the same error?
The sample in GIS-Tools-for-Hadoop in fact has a constructor in the where clause. Which goes to say, ST-Geometry constructor in a where clause should be fine, and has been done plenty of times.
@climbage - you nailed it - I was about to update - the issue indeed has to do with vectorization and setting hive.vectorized.execution.enabled
to false
does the trick!
@randallwhitman - you are correct but I was not able to get it to work so I was wondering about it...
Thanks for your time and comments!
Hi,
First off, thanks for providing this framework - it is very comprehensive, well document and easy to get started with!
Version information: Hive 1.2.1000.2.6.2.0-205 Hadoop 2.7.3.2.6.2.0-205 (I believe it's install as part of HDP distributon hdp/2.6.2.0-205/) I clone the sample and using the jars from the lib folder so I assume it is latest.
I am trying to do spatial aggregation where I need to get the number of points within pre-defined areas.
For the sake of this discussion, lets assume I have an areas table:
To which I load geo-JSON data using the serdes provided.
Next, I have a points table:
This table is created as a results of a pyspark job with some custom algorithm.
What I am trying to do is aggregate to get the number of points per area*:
`* This queries is a simplified version. In my full query, I am using binning in the join to narrow the search space. It works nicely, giving me a two fold improvement
This fails**. I've tried many variations but if all give me some variation on the log below. Generally, I get variations on
sometimes with the following as well:
On the other hand, when I create the point in the table definition, i.e. I make another step / table which includes a point:
The following query works fine
Am I doing something wrong in the first approach or is it not OK to call the constructor functions inline?
error.txt