apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.6k stars 3.25k forks source link

[Bug] #10612

Open foobarrer opened 2 years ago

foobarrer commented 2 years ago

Search before asking

Version

FE: 1.0.0-rc03-Unknown BE: 1.0.0-rc03-Unknown

What's Wrong?

when I q2.sql in */tools/tpch-tools/queries/q2.sql in source code of Doirs, All mysql three BE is dead ! So I Reproduce it and tail the log of one of three BE, below is the log

I0706 14:08:06.791376  8640 data_dir.cpp:739] path: /data8/fjyulu/apache-doris-1.0.0-incubating-bin/be/storage total capacity: 9921845387264, available capacity: 3276341948416
I0706 14:08:06.791924  8640 storage_engine.cpp:374] get root path info cost: 0 ms. tablet counter: 461
I0706 14:08:06.792373  8640 task_worker_pool.cpp:1621] finish report DISK. master host: 192.168.134.32, port: 39020
I0706 14:08:07.798548  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:08:09.796348  8641 tablet_manager.cpp:882] begin to build all report tablets info
I0706 14:08:09.796386  8641 tablet_manager.cpp:887] find expired transactions for 0 tablets
I0706 14:08:09.797224  8641 tablet_manager.cpp:924] success to build all report tablets info. tablet_count=461
I0706 14:08:09.798719  8641 task_worker_pool.cpp:1621] finish report TABLET. master host: 192.168.134.32, port: 39020
I0706 14:08:22.799257  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:08:34.799919  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:08:35.982132  8697 plan_fragment_executor.cpp:75] PlanFragmentExecutor::prepare|pthread_id=139637499447040|backend_num=1|instance_id=b2e891a0079a4fdb-a664820c2e257482|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:35.994038  8530 fragment_mgr.cpp:459] PlanFragmentExecutor::_exec_actual|pthread_id=139638859065088|instance_id=b2e891a0079a4fdb-a664820c2e257482|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:35.994088  8530 plan_fragment_executor.cpp:233] PlanFragmentExecutor::open, using query memory limit: 8.00 GB|mem_limit=8589934592|instance_id=b2e891a0079a4fdb-a664820c2e257482|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.003167  8700 plan_fragment_executor.cpp:75] PlanFragmentExecutor::prepare|pthread_id=139637474268928|backend_num=6|instance_id=b2e891a0079a4fdb-a664820c2e257490|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.004102  8700 vdata_stream_sender.h:176] will use local Exchange, dest_node_id is : 24
I0706 14:08:36.004206  8487 fragment_mgr.cpp:459] PlanFragmentExecutor::_exec_actual|pthread_id=139639211558656|instance_id=b2e891a0079a4fdb-a664820c2e257490|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.004226  8487 plan_fragment_executor.cpp:233] PlanFragmentExecutor::open, using query memory limit: 8.00 GB|mem_limit=8589934592|instance_id=b2e891a0079a4fdb-a664820c2e257490|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.006261  8675 plan_fragment_executor.cpp:75] PlanFragmentExecutor::prepare|pthread_id=139637684086528|backend_num=9|instance_id=b2e891a0079a4fdb-a664820c2e25748d|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.007759  8675 vdata_stream_sender.h:176] will use local Exchange, dest_node_id is : 23
I0706 14:08:36.007896  8501 fragment_mgr.cpp:459] PlanFragmentExecutor::_exec_actual|pthread_id=139639094060800|instance_id=b2e891a0079a4fdb-a664820c2e25748d|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.007915  8501 plan_fragment_executor.cpp:233] PlanFragmentExecutor::open, using query memory limit: 8.00 GB|mem_limit=8589934592|instance_id=b2e891a0079a4fdb-a664820c2e25748d|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.011364  8672 plan_fragment_executor.cpp:75] PlanFragmentExecutor::prepare|pthread_id=139637709264640|backend_num=12|instance_id=b2e891a0079a4fdb-a664820c2e257485|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.018893  8672 vdata_stream_sender.h:176] will use local Exchange, dest_node_id is : 22
I0706 14:08:36.019023  8486 fragment_mgr.cpp:459] PlanFragmentExecutor::_exec_actual|pthread_id=139639219951360|instance_id=b2e891a0079a4fdb-a664820c2e257485|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.019047  8486 plan_fragment_executor.cpp:233] PlanFragmentExecutor::open, using query memory limit: 8.00 GB|mem_limit=8589934592|instance_id=b2e891a0079a4fdb-a664820c2e257485|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.027694  8672 plan_fragment_executor.cpp:75] PlanFragmentExecutor::prepare|pthread_id=139637709264640|backend_num=17|instance_id=b2e891a0079a4fdb-a664820c2e257488|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.029170  8672 vdata_stream_sender.h:176] will use local Exchange, dest_node_id is : 19
I0706 14:08:36.029265  8507 fragment_mgr.cpp:459] PlanFragmentExecutor::_exec_actual|pthread_id=139639043704576|instance_id=b2e891a0079a4fdb-a664820c2e257488|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.029299  8507 plan_fragment_executor.cpp:233] PlanFragmentExecutor::open, using query memory limit: 8.00 GB|mem_limit=8589934592|instance_id=b2e891a0079a4fdb-a664820c2e257488|query_id=b2e891a0079a4fdb-a664820c2e257481
I0706 14:08:36.130455  8507 plan_fragment_executor.cpp:678] Close() fragment_instance_id=b2e891a0079a4fdb-a664820c2e257488
I0706 14:08:36.166729  8487 plan_fragment_executor.cpp:678] Close() fragment_instance_id=b2e891a0079a4fdb-a664820c2e257490
I0706 14:08:37.913976  8501 plan_fragment_executor.cpp:678] Close() fragment_instance_id=b2e891a0079a4fdb-a664820c2e25748d
I0706 14:08:48.800683  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:08:53.708926  8571 load_channel_mgr.cpp:252] cleaning timed out load channels
I0706 14:08:53.708952  8571 load_channel_mgr.cpp:285] load mem consumption(bytes). limit: 107374182400, current: 0, peak: 0
I0706 14:09:00.642282  8909 heartbeat_server.cpp:58] get heartbeat from FE.host:192.168.134.32, port:39020, cluster id:1748834092, counter:49
I0706 14:09:02.801405  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:09:07.801204  8640 data_dir.cpp:739] path: /data8/fjyulu/apache-doris-1.0.0-incubating-bin/be/storage total capacity: 9921845387264, available capacity: 3276341891072
I0706 14:09:07.801723  8640 storage_engine.cpp:374] get root path info cost: 0 ms. tablet counter: 461
I0706 14:09:07.802167  8640 task_worker_pool.cpp:1621] finish report DISK. master host: 192.168.134.32, port: 39020
I0706 14:09:13.801931  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:09:14.799396  8641 tablet_manager.cpp:882] begin to build all report tablets info
I0706 14:09:14.799453  8641 tablet_manager.cpp:887] find expired transactions for 0 tablets
I0706 14:09:14.801268  8641 tablet_manager.cpp:924] success to build all report tablets info. tablet_count=461
I0706 14:09:14.802610  8641 task_worker_pool.cpp:1621] finish report TABLET. master host: 192.168.134.32, port: 39020
I0706 14:09:26.802498  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:09:30.320729 28065 backend_service.cpp:353] get_batch stream_load_record rocksdb successfully. records size: 0, last_stream_load_timestamp: -1
I0706 14:09:39.803200  8638 task_worker_pool.cpp:1621] finish report TASK. master host: 192.168.134.32, port: 39020
I0706 14:09:45.613303  8486 plan_fragment_executor.cpp:678] Close() fragment_instance_id=b2e891a0079a4fdb-a664820c2e257485

I think the log is start with PlanFragmentExecutor in it When I run the sql.

What You Expected?

BE don't dead silent, let me know the reason when BE dead at least

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

Code of Conduct

hf200012 commented 2 years ago

Please test tcp-h on 1.1 rc05, 1.0 has not thoroughly optimized this test set。 You can add me WeChat: 35926237