-
This is a collection of items to improve external (spilling) aggregation
### Background
> Abstract—Analytical database systems offer high-performance in-memory aggregation. If there are many uniqu…
alamb updated
4 weeks ago
-
### Is your feature request related to a problem or challenge?
For measuring the performance improvement of #11827 , some extended queries with `more complex udaf(like median, approx_median)` + `h…
-
https://docs.pingcap.com/tidb-in-kubernetes/stable/get-started#step-2-deploy-tidb-operator
-
**Describe the bug**
`spiced` reports extreme dataset sizes after accelerating them to arrow:
```bash
2024-07-09T23:02:53.692352Z INFO runtime: Dataset lineitem registered (odbc:lineitem), acc…
-
Your documentation page [DataFrames at Scale Comparison: TPC-H](https://docs.coiled.io/blog/tpch.html?utm_source=dask-blog&utm_medium=dask-is-fast) has some good information on [how you setup the benc…
jgrg updated
4 months ago
-
on https://docs.databend.com/guides/benchmark/tpch there is no date on when the TPC-H benchmark was run
-
Similar #1498. I think that as the queries are currently written it isn't a fair comparison between DataFrame API's.
For SQL it is fair as the TPCH benchmark states that all engines should parse th…
-
### Is there an existing issue for the same bug?
- [X] I have checked the existing issues.
### Branch Name
2.0-dev
### Commit ID
3fced2dccf3763254979dd9f196e3896d97aa322
### Other Environment In…
-
For example, TPC-H Q3
```
┌─────────────────────────────────────────────────────────────────┬───────┬────────┬───────────┬───────────┬───────────┐
│ Operation …
-
I have not yet looked into the details, but I wonder: what is the difference of this project compared to
https://github.com/2ndQuadrant/pg-tpch
?
It seems that you are both working for the same c…