oap-project / gazelle_plugin

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Apache License 2.0
256 stars 77 forks source link

Unexpected heap memory and off-heap memory allocation #1208

Open ziyangRen opened 1 year ago

ziyangRen commented 1 year ago

Describe the bug Running Spark tasks using gazelle should use more off-heap memory. However, when we run 2TB tpc-ds, we find that for most applications, the onheap memory usage is much greater than that in the off-heap memory, and some SQL statements do not even use off-heap memory, which troubles our parameter configuration. notes:Traverse all SparkListenerTaskEnd events in the eventlog, obtain the maximum values of JVMHeapMemory and OffHeapExecutionMemory in all Task Executor Metrics.Used as the basis for determining the memory usage onheap and offheap.The test results are as follows(The memory unit is MB.): image image