apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.77k stars 3.29k forks source link

[Bug] org.apache.doris.rpc.RpcException: timeout when waiting for send execution start RPC. Wait(sec): 30, host: #44343

Open lisihan0607 opened 1 day ago

lisihan0607 commented 1 day ago

Search before asking

Version

2.0.2.1

What's Wrong?

正常运行中,偶尔出现超时30s的异常 fe日志 org.apache.doris.rpc.RpcException: timeout when waiting for send execution start RPC. Wait(sec): 30, host: be日志 W1120 13:52:28.695605 2215467 pipeline_fragment_context.cpp:159] PipelineFragmentContext Canceled. reason= W1120 13:52:28.695755 2215349 fragment_mgr.cpp:1014] Do not find the fragment instance id:TUniqueId(hi=4748102679975577237, lo=-6238471276923290687) to cancel W1120 13:52:28.695878 2214751 fragment_mgr.cpp:481] report error status: [CANCELLED] to coordinator: TNetworkAddress(hostname=10.177.16.174, port=9020), query id: 41e4a63b1d064695-a96c7f6d2d827bbd, instance id: 41e4a63b1d064695-a96c7f6d2d827bd8 W1120 13:52:28.696149 2214751 status.h:383] meet error status: [RUNTIME_ERROR]

  1. /root/src/doris-2.0/be/src/common/stack_trace.cpp:302: StackTrace::tryCapture() @ 0x000000000b9e64b7 in /opt/doris-2.0.2.1/be/lib/doris_be
  2. /root/src/doris-2.0/be/src/common/stack_trace.h:0: doris::get_stack_trace[abi:cxx11]() @ 0x000000000b9e4ad5 in /opt/doris-2.0.2.1/be/lib/doris_be
  3. /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187: doris::Status doris::Status::Error(int, std::basic_string_view<char, std::char_traits >) @ 0x000000000aeb998a in /opt/doris-2.0.2.1/be/lib/doris_be
  4. /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187: doris::Status::create(doris::TStatus const&) @ 0x000000000b6c4fee in /opt/doris-2.0.2.1/be/lib/doris_be
  5. /root/src/doris-2.0/be/src/common/status.h:348: doris::FragmentMgr::coordinator_callback(doris::ReportStatusRequest const&) @ 0x000000000b7e3f6c in /opt/doris-2.0.2.1/be/lib/doris_be
  6. /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:244: doris::pipeline::PipelineFragmentContext::send_report(bool) @ 0x0000000012481b6f in /opt/doris-2.0.2.1/be/lib/doris_be
  7. /root/src/doris-2.0/be/src/pipeline/task_scheduler.cpp:264: doris::pipeline::TaskScheduler::_do_work(unsigned long) @ 0x00000000124cedaf in /opt/doris-2.0.2.1/be/lib/doris_be
  8. /root/src/doris-2.0/be/src/util/threadpool.cpp:0: doris::ThreadPool::dispatch_thread() @ 0x000000000ba2213c in /opt/doris-2.0.2.1/be/lib/doris_be
  9. /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562: doris::Thread::supervise_thread(void*) @ 0x000000000ba1818a in /opt/doris-2.0.2.1/be/lib/doris_be
  10. ? @ 0x00007f922837647a in ?
  11. ? @ 0x00007f92283f90e0 in ?

What You Expected?

出现该问题的原因?

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

Code of Conduct