apache / datafusion

Apache DataFusion SQL Query Engine
https://datafusion.apache.org/
Apache License 2.0
6.33k stars 1.2k forks source link

Improve test coverage for spilling (memory-limited) sort/aggregation/sort-merge-join #13431

Open 2010YOUY01 opened 1 week ago

2010YOUY01 commented 1 week ago

Is your feature request related to a problem or challenge?

Now DataFusion supports spilling for memory-limited sort/aggregation/sort-merge-join operators. In order to stabilize those features https://github.com/apache/datafusion/issues/13123, more tests should be added.

The high-level objectives are:

  1. Memory limit stress tests for related operators: Run queries with large memory consumption (sorting a large relational table for example), and use minimal possible memory limit to let them run successfully
  2. Add integration test to make sure physical memory usage respects to configured memory limits
  3. Add fuzz tests to ensure the correctness of memory-limited queries (spilled and in-memory execution should return equivalent results)

Describe the solution you'd like

No response

Describe alternatives you've considered

No response

Additional context

No response