Closed Lordeath closed 2 years ago
sometimes quit by:
I1025 23:29:03.222785 15293 server.cpp:1100] Server[starrocks::BackendInternalServiceImpl<starrocks::PInternalService>+starrocks::LakeServiceImpl+starrocks::BackendInternalServiceImpl<doris::PBackendService>] is going to quit
no idea why
[root@localhost bin]# java -version
openjdk version "1.8.0_345"
OpenJDK Runtime Environment (build 1.8.0_345-b01)
OpenJDK 64-Bit Server VM (build 25.345-b01, mixed mode)
export JAVA_HOME=/usr/lib/jvm/java-1.8.0-openjdk-1.8.0.345.b01-1.el7_9.x86_64
阿里云服务上买的机器的BE也挂了,版本是2.3.0,be.out的输出是这样的:
I1027 07:27:43.205304 30725 rowset_merger.cpp:263] compaction merge finished. tablet=11387 #key=3 algorithm=VERTICAL_COMPACTION column_group_size=38 input(entry=40 rows=23360 del=0 actual=23360 bytes=6.08 MB) output(rows=23360 chunk=44 bytes=4.76 MB) duration: 371ms
I1027 07:27:43.211071 30725 tablet_updates.cpp:1215] commit compaction tablet:11387 version:789.1 rowset:812 #seg:1 #row:23360 size:4.76 MB #pending:0 state_memory:1.49 MB
I1027 07:27:43.211123 19631 tablet_updates.cpp:1241] apply_compaction_commit start tablet:11387 version:789.1 rowset:812
I1027 07:27:43.218067 19631 tablet_updates.cpp:1388] apply_compaction_commit finish tablet:11387 version:789.1 total del/row:0/23360 0% rowset:812 #row:23360 #del:0 #delvec:1 duration:6ms(0/6/0)
I1027 07:27:43.218343 30725 tablet_manager.cpp:672] Found the best tablet to compact. compaction_type=update tablet_id=11269 highest_score=1268949620
I1027 07:27:43.218374 30725 tablet_updates.cpp:1688] update compaction start tablet:11269 version:789 score:1268949632 pick:39/valid:39/all:39 773,774,775,776,777,778,779,780,781,782,783,784,785,786,787,788,789,790,791,792,793,794,795,796,797,798,799,800,801,802,803,804,805,806,807,808,809,810,811 #rows:23494->23494 bytes:5.91 MB->5.91 MB(estimate)
I1027 07:27:43.247026 30723 rowset_merger.cpp:263] compaction merge finished. tablet=11573 #key=3 algorithm=VERTICAL_COMPACTION column_group_size=38 input(entry=39 rows=23307 del=0 actual=23307 bytes=5.95 MB) output(rows=23307 chunk=43 bytes=4.75 MB) duration: 266ms
I1027 07:27:43.250888 30723 tablet_updates.cpp:1215] commit compaction tablet:11573 version:789.1 rowset:812 #seg:1 #row:23307 size:4.75 MB #pending:0 state_memory:1.49 MB
I1027 07:27:43.250921 19631 tablet_updates.cpp:1241] apply_compaction_commit start tablet:11573 version:789.1 rowset:812
I1027 07:27:43.255218 19631 tablet_updates.cpp:1388] apply_compaction_commit finish tablet:11573 version:789.1 total del/row:0/23307 0% rowset:812 #row:23307 #del:0 #delvec:1 duration:5ms(0/5/0)
I1027 07:27:43.255479 30723 tablet_manager.cpp:672] Found the best tablet to compact. compaction_type=update tablet_id=11775 highest_score=1268927983
I1027 07:27:43.255498 30723 tablet_updates.cpp:1688] update compaction start tablet:11775 version:789 score:1268928000 pick:39/valid:39/all:39 773,774,775,776,777,778,779,780,781,782,783,784,785,786,787,788,789,790,791,792,793,794,795,796,797,798,799,800,801,802,803,804,805,806,807,808,809,810,811 #rows:23413->23413 bytes:5.99 MB->5.99 MB(estimate)
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
src/central_freelist.cc:333] tcmalloc: allocation failed 40960
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
terminate called recursively
src/central_freelist.cc:333] tcmalloc: allocation failed 8192
terminate called after throwing an instance of 'terminate called recursively
*** Aborted at 1666826863 (unix time) try "date -d @1666826863" if you are using GNU date ***
St9bad_alloc'
what(): std::bad_allocPC: @ 0x7fc42373c387 __GI_raise
*** SIGABRT (@0x75fa) received by PID 30202 (TID 0x7fc34650b700) from PID 30202; stack trace: ***
@ 0x3fa3ad2 google::(anonymous namespace)::FailureSignalHandler()
@ 0x7fc4241f1630 (unknown)
@ 0x7fc42373c387 __GI_raise
@ 0x7fc42373da78 __GI_abort
@ 0x59a87f2 __gnu_cxx::__verbose_terminate_handler()
@ 0x59a72a6 __cxxabiv1::__terminate()
@ 0x59a7311 std::terminate()
@ 0x59a7464 __cxa_throw
@ 0x1888484 _Znwm.cold
@ 0x1b2ab1e starrocks::vectorized::ChunkHelper::convert_field_to_format_v2()
@ 0x1b2b9c1 starrocks::vectorized::ChunkHelper::convert_schema_to_format_v2()
@ 0x1bb1388 starrocks::vectorized::MemTable::MemTable()
@ 0x3315bb4 starrocks::vectorized::DeltaWriter::_reset_mem_table()
@ 0x33161c0 starrocks::vectorized::DeltaWriter::write()
@ 0x3307943 starrocks::vectorized::AsyncDeltaWriter::_execute()
@ 0x40742ec bthread::ExecutionQueueBase::_execute()
@ 0x40750b8 bthread::ExecutionQueueBase::_execute_tasks()
@ 0x2132549 starrocks::ThreadPool::dispatch_thread()
@ 0x212e0fa starrocks::Thread::supervise_thread()
@ 0x7fc4241e9ea5 start_thread
@ 0x7fc423804b0d __clone
@ 0x0 (unknown)
query_id:00000000-0000-0000-0000-000000000000, fragment_instance:00000000-0000-0000-0000-000000000000
*** Aborted at 1667285873 (unix time) try "date -d @1667285873" if you are using GNU date ***
PC: @ 0x60d9c38 (unknown)
*** SIGSEGV (@0x7efd59cdc578) received by PID 37551 (TID 0x7efd614f0700) from PID 1506657656; stack trace: ***
@ 0x481e332 (unknown)
@ 0x7efe2f92d630 (unknown)
@ 0x60d9c38 (unknown)
@ 0x60db403 (unknown)
@ 0x60db5b8 (unknown)
@ 0x60dca9b (unknown)
@ 0x6063580 jemalloc_usable_size
@ 0x25c4bf5 free
@ 0x7efe2efb3522 __libc_thread_freeres
@ 0x7efe2f925eb8 start_thread
@ 0x7efe2ef4096d __clone
@ 0x0 (unknown)
是我的问题,我把最后的 tailf /opt/flink/flink-1.14.6/log/flink-root-taskexecutor-0-*.log 去掉就好了
lastest be.INFO:
found this bug on different computers So is my shell script wrong?
Steps to reproduce the behavior (Required)
i just run my x.sh:
Expected behavior (Required)
do not shut down BE
Real behavior (Required)
StarRocks version (Required)
select current_version()
2.4.0 RELEASE (build c0fa2bb) Built on 2022-10-20 15:08:05 by StarRocks@docker