apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.79k stars 3.29k forks source link

[Bug] BE crashes with bvar::detail::SamplerCollector::sampling_thread #18338

Open GronwD opened 1 year ago

GronwD commented 1 year ago

Search before asking

Version

1.2.0

What's Wrong?

One of our BE was not able to be started, as a result we removed the BE; After that other BEs fails with below error frequently. Please kindly help.

Query id: 0-0 Aborted at 1680507855 (unix time) try "date -d @1680507855" if you are using GNU date Current BE git commitID: Unknown SIGSEGV address not mapped to object (@0x35) received by PID 2863096 (TID 0x7f9f169ff700) from PID 53; stack trace: 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t, void) at /root/doris/be/src/common/signal_handler.h:420 1# 0x00007F9F13DFF2B7 in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so 2# JVM_handle_linux_signal in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so 3# 0x00007F9F13DF80AC in /usr/lib/jvm/java-8-openjdk-amd64/jre/lib/amd64/server/libjvm.so 4# 0x00007F9F173ED090 in /lib/x86_64-linux-gnu/libc.so.6 5# bvar::Reducer<long, bvar::detail::AddTo, bvar::detail::MinusFrom >::SeriesSampler::take_sample() in /root/apache-doris-be-1.2.0-bin-x86_64/lib/doris_be 6# bvar::detail::SamplerCollector::run() in /root/apache-doris-be-1.2.0-bin-x86_64/lib/doris_be 7# bvar::detail::SamplerCollector::sampling_thread(void*) in /root/apache-doris-be-1.2.0-bin-x86_64/lib/doris_be 8# start_thread in /lib/x86_64-linux-gnu/libpthread.so.0 9# __clone in /lib/x86_64-linux-gnu/libc.so.6

What You Expected?

BE should not exit

How to Reproduce?

It's like when we run some query on big tables such exception shows in be.out

Anything Else?

No response

Are you willing to submit PR?

Code of Conduct

GronwD commented 1 year ago

after investigation we found this happens when we're querying a very large table without partition key in where conditions. hope this helps. The table is like 3TB+ with aound 60 partitions.