apache / datafusion

Apache DataFusion SQL Query Engine
https://datafusion.apache.org/
Apache License 2.0
5.53k stars 1.03k forks source link

Support different types of Hash tables in Grouping/Aggregation #6670

Open mingmwang opened 1 year ago

mingmwang commented 1 year ago

Is your feature request related to a problem or challenge?

No response

Describe the solution you'd like

No response

Describe alternatives you've considered

No response

Additional context

No response

alamb commented 1 year ago

What types of hash tables do you have in mind?

mingmwang commented 1 year ago

What types of hash tables do you have in mind?

I would like to do some POC with the partitioned hash table first. There are some other types of hash table which are optimized for different type of hash key.

Clickhouse hash tables: https://github.com/ClickHouse/ClickHouse/tree/master/src/Common/HashTable

DuckDB: https://github.com/duckdb/duckdb/blob/master/src/execution/partitionable_hashtable.cpp