matrixorigin / matrixone

Hyperconverged cloud-edge native database
https://docs.matrixorigin.cn/en
Apache License 2.0
1.79k stars 277 forks source link

[Subtask]: loading tpch data too slow #20243

Open badboynt1 opened 20 hours ago

badboynt1 commented 20 hours ago

Parent Issue

20242

Detail of Subtask

loading tpch data is too slow, need to improve image image

pprof.mo-service.samples.cpu.006.pb.gz

Describe implementation you've considered

No response

Additional information

No response

ouyuanning commented 18 hours ago

波洋先处理一下 1、看一下多CN的负载分配是否均匀 2、并行数是否合理 3、再看看csv parser和external scan有哪些可以优化的吧