apache / iceberg

Apache Iceberg
https://iceberg.apache.org/
Apache License 2.0
6.38k stars 2.2k forks source link

trino iceberg planning time so long #2619

Closed maxingui closed 4 months ago

maxingui commented 3 years ago

we use trino iceberg with huawei OBS,the planning time is more than 20 s,while the total SCHEDULE and RUNNING time is only 2 s ,I do not what worry with it;

the debug log is like that: 2021-05-20T06:06:01.039Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.69ms 2021-05-20T06:06:01.049Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02".customer snapshot 5772262083695712847 created at 2021-05-18 07:43:19.363 with filter true 2021-05-20T06:06:01.127Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.70ms 2021-05-20T06:06:01.136Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 7.93ms 2021-05-20T06:06:01.158Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:customer, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.10ms 2021-05-20T06:06:01.171Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02".customer snapshot 5772262083695712847 created at 2021-05-18 07:43:19.363 with filter true 2021-05-20T06:06:01.193Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.sql.planner.DistributedExecutionPlanner Dynamic filters: [Descriptor{id=df_2014, input="sr_returned_date_sk_0", operator=EQUAL, nullAllowed=false}] 2021-05-20T06:06:01.203Z DEBUG Query-20210520_060537_00323_gbd7b-4734 io.trino.plugin.hive.metastore.thrift.ThriftHiveMetastoreClient Invocation of get_table_req(req=GetTableRequest(dbName:icebergsf1_02, tblName:store_returns, capabilities:ClientCapabilities(values:[INSERT_ONLY_TABLES]))) succeeded in 8.39ms 2021-05-20T06:06:01.214Z INFO Query-20210520_060537_00323_gbd7b-4734 org.apache.iceberg.BaseTableScan Scanning table "icebergsf1_02"."store_returns" snapshot 3075814051181790716 created at 2021-05-18 07:44:47.401 with filter true

dixingxing0 commented 3 years ago

It seems your log is incomplete, i think it's better to have more log.

Since you created this issue in iceberg project, i assume you are more inclined to think it is related to iceberg, do you have more clues about it?

Did you have tried with other engine like spark sql, if spark sql also planned slow, then maybe it is irrelevant to trino.

BTW, i've tried trino356 (with PR https://github.com/trinodb/trino/pull/7367), the analyze phase was much faster, like from 2s reduced to 1s in my test case.

maxingui commented 3 years ago

thinks for you reply,my trino version is 356 too; the issue does not appear in hdfs+hadoop environment,but trino iceberg with huawei OBS accurs log as below: 22.log

github-actions[bot] commented 6 months ago

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

github-actions[bot] commented 4 months ago

This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'