Closed nehanene15 closed 2 years ago
Ibis generates a group_by query like so:
SELECT `bikeid`, count(*) AS `count` FROM `bigquery-public-data.new_york_citibike.citibike_trips` GROUP BY 1
which is unsupported in Hive. Group_By requires the column name instead of the integer reference.
This will require overwritting the fomrat_group_by() function in third_party/ibis/ibis_impala/compiler.py so that we can reference the column name.
I was able to solve this issue by setting below property in Hive hive.groupby.orderby.position.alias=true
There are two approaches to set this property:
I followed approach 1 and it fixed the issue.
Ibis generates a group_by query like so:
which is unsupported in Hive. Group_By requires the column name instead of the integer reference.
This will require overwritting the fomrat_group_by() function in third_party/ibis/ibis_impala/compiler.py so that we can reference the column name.