Sub-partitioning by symbols

QuestDB partitions time-series data by time. However, with large amounts of data combined with many symbols (think, time series), queries such as SAMPLE BY or LATEST BY for a specific symbol can sometimes be slow. Adding an additional partitioning strategy for single or multiple symbol columns would boost query performance significantly. The table with a given symbol would then be virtualized and accessed a lot faster than otherwise.

This feature would support WAL tables only.

As an example, users could partition the table by times, such as hour as well as product_id column. Then if they run a SAMPLE BY query for a single product_id, only a single sub-partition would be accessed by the database yielding a significant query speed improvement over index-based access on a non-sub-partitioned table.

In the next version of this feature, we're going to add support for sub-partitioning by geo-hash column(s) in order to solve this issue: https://github.com/questdb/questdb/issues/2967

questdb / roadmap

Sub-partitioning by symbols #45