Open shamanthchandra-yb opened 9 months ago
Jira Link: DB-8169
We were able to 2 times (20% of time with original workload of CQL)
As per discussion, memtracker for CDC records needs to be taken a look. As discussed here https://yugabyte.slack.com/archives/C03H4D4EVC6/p1695806406849879
Discussed with @suranjan, that the universe shared already had memtracker cleaned up. So, we need to analyze fresh run. To track this, we can track this with ticket.
thread #1, name = 'yb-tserver', stop reason = signal SIGABRT * frame #0: 0x00007f3104d460a7 libc.so.6`__GI_raise(sig=6) at raise.c:54 frame #1: 0x00007f3104d474aa libc.so.6`__GI_abort at abort.c:89 frame #2: 0x000055b726a410c8 yb-tserver`tcmalloc::tcmalloc_internal::Crash(tcmalloc::tcmalloc_internal::CrashMode, char const*, int, tcmalloc::tcmalloc_internal::LogItem, tcmalloc::tcmalloc_internal::LogItem, tcmalloc::tcmalloc_internal::LogItem, tcmalloc::tcmalloc_internal::LogItem, tcmalloc::tcmalloc_internal::LogItem, tcmalloc::tcmalloc_internal::LogItem) + 312 frame #3: 0x000055b726a4703b yb-tserver`tcmalloc::tcmalloc_internal::CppOomPolicy::handle_oom(unsigned long) + 123 frame #4: 0x000055b726a068aa yb-tserver`void* slow_alloc<tcmalloc::tcmalloc_internal::TCMallocPolicy<tcmalloc::tcmalloc_internal::CppOomPolicy, tcmalloc::tcmalloc_internal::DefaultAlignPolicy, tcmalloc::tcmalloc_internal::AllocationAccessHotPolicy, tcmalloc::tcmalloc_internal::InvokeHooksPolicy, tcmalloc::tcmalloc_internal::LocalNumaPartitionPolicy>, std::nullptr_t>(tcmalloc::tcmalloc_internal::TCMallocPolicy<tcmalloc::tcmalloc_internal::CppOomPolicy, tcmalloc::tcmalloc_internal::DefaultAlignPolicy, tcmalloc::tcmalloc_internal::AllocationAccessHotPolicy, tcmalloc::tcmalloc_internal::InvokeHooksPolicy, tcmalloc::tcmalloc_internal::LocalNumaPartitionPolicy>, unsigned long, std::nullptr_t) + 794 frame #5: 0x000055b726a0639e yb-tserver`TCMallocInternalNew + 414 frame #6: 0x000055b724984d65 yb-tserver`std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char>>::push_back(char) + 165 frame #7: 0x000055b72496f5dd yb-tserver`std::__1::basic_stringbuf<char, std::__1::char_traits<char>, std::__1::allocator<char>>::overflow(int) + 109 frame #8: 0x000055b72498afcf yb-tserver`std::__1::basic_ostream<char, std::__1::char_traits<char>>::put(char) + 111 frame #9: 0x000055b72615572e yb-tserver`rapidjson::Writer<yb::UTF8StringStreamBuffer, rapidjson::UTF8<char>, rapidjson::UTF8<char>, rapidjson::CrtAllocator, 0u>::WriteString(this=0x000006adb4704670, str="[e1a49ff3-001d-4bfb-8897-8893eeb75df7:182854:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa:seed, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:s"..., length=4096) at writer.h:0 frame #10: 0x000055b726156dbe yb-tserver`yb::JsonWriter::Protobuf(google::protobuf::Message const&) [inlined] yb::JsonWriter::String(this=0x00007f3031dcc910, str="[e1a49ff3-001d-4bfb-8897-8893eeb75df7:182854:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa:seed, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:seed:S4O7aTT60334R07PTP0??a!0??a6SSP!8RT!T5P8bUU71445S18QUQ1aab?1aab7TTQ?9SU?U6Q9cVV82556T29RVR2bbca2bbc8UURa!TVa, e1a49ff3-001d-4bfb-8897-8893eeb75df7:val:182854:s"...) at jsonwriter.cc:200:53 frame #11: 0x000055b726156db0 yb-tserver`yb::JsonWriter::Protobuf(google::protobuf::Message const&) [inlined] yb::JsonWriter::ProtobufField(this=0x00007f3031dcc910, pb=0x000006acbcbc4d00, field=0x000006acbe473b70) at jsonwriter.cc:285:7 frame #12: 0x000055b726156bcc yb-tserver`yb::JsonWriter::Protobuf(this=0x00007f3031dcc910, pb=0x000006acbcbc4d00) at jsonwriter.cc:251:7 frame #13: 0x000055b726156a20 yb-tserver`yb::JsonWriter::ProtobufRepeatedField(this=0x00007f3031dcc910, pb=0x000006ad4c93f1c0, field=0x000006acb4daa098, index=149) at jsonwriter.cc:326:7 frame #14: 0x000055b726156bc2 yb-tserver`yb::JsonWriter::Protobuf(this=0x00007f3031dcc910, pb=0x000006ad4c93f1c0) at jsonwriter.cc:247:9 frame #15: 0x000055b726156df6 yb-tserver`yb::JsonWriter::Protobuf(google::protobuf::Message const&) [inlined] yb::JsonWriter::ProtobufField(this=0x00007f3031dcc910, pb=0x000006ad96913310, field=0x000006aca79fcbf8) at jsonwriter.cc:288:7 frame #16: 0x000055b726156bcc yb-tserver`yb::JsonWriter::Protobuf(this=0x00007f3031dcc910, pb=0x000006ad96913310) at jsonwriter.cc:251:7 frame #17: 0x000055b726156a20 yb-tserver`yb::JsonWriter::ProtobufRepeatedField(this=0x00007f3031dcc910, pb=0x000006ad41c3c300, field=0x000006acbf72e2f8, index=4) at jsonwriter.cc:326:7 frame #18: 0x000055b726156bc2 yb-tserver`yb::JsonWriter::Protobuf(this=0x00007f3031dcc910, pb=0x000006ad41c3c300) at jsonwriter.cc:247:9 frame #19: 0x000055b726156a20 yb-tserver`yb::JsonWriter::ProtobufRepeatedField(this=0x00007f3031dcc910, pb=0x00007f3031dcc860, field=0x000006acb81b6be0, index=11) at jsonwriter.cc:326:7 frame #20: 0x000055b726156bc2 yb-tserver`yb::JsonWriter::Protobuf(this=0x00007f3031dcc910, pb=0x00007f3031dcc860) at jsonwriter.cc:247:9 frame #21: 0x000055b725abc20b yb-tserver`yb::(anonymous namespace)::RpczPathHandler(messenger=<unavailable>, req=<unavailable>, resp=0x00007f3031dcc960) at rpcz-path-handler.cc:77:10 frame #22: 0x000055b725add408 yb-tserver`yb::Webserver::Impl::BeginRequestCallbackStatic(sq_connection*) [inlined] std::__1::__function::__value_func<void (yb::WebCallbackRegistry::WebRequest const&, yb::WebCallbackRegistry::WebResponse*)>::operator(this=0x000006acbe8d0380, __args=0x00007f3031dcea88, __args=0x00007f3031dce970)[abi:v160006](yb::WebCallbackRegistry::WebRequest const&, yb::WebCallbackRegistry::WebResponse*&&) const at function.h:510:16 frame #23: 0x000055b725add3ee yb-tserver`yb::Webserver::Impl::BeginRequestCallbackStatic(sq_connection*) [inlined] std::__1::function<void (yb::WebCallbackRegistry::WebRequest const&, yb::WebCallbackRegistry::WebResponse*)>::operator(this=0x000006acbe8d0380, __arg=0x00007f3031dcea88, __arg=0x00007f3031dcc960)(yb::WebCallbackRegistry::WebRequest const&, yb::WebCallbackRegistry::WebResponse*) const at function.h:1156:12 frame #24: 0x000055b725add3ee yb-tserver`yb::Webserver::Impl::BeginRequestCallbackStatic(sq_connection*) at webserver.cc:625:5 frame #25: 0x000055b725add0d7 yb-tserver`yb::Webserver::Impl::BeginRequestCallbackStatic(sq_connection*) [inlined] yb::Webserver::Impl::BeginRequestCallback(this=0x000006acbefe5400, connection=0x000000cb7827a000, request_info=0x000000cb7827a000) at webserver.cc:557:10 frame #26: 0x000055b725adc67b yb-tserver`yb::Webserver::Impl::BeginRequestCallbackStatic(connection=0x000000cb7827a000) at webserver.cc:532:20 frame #27: 0x000055b725ae3a87 yb-tserver`process_new_connection + 6119 frame #28: 0x000055b725ae213a yb-tserver`worker_thread + 906 frame #29: 0x00007f3104afc694 libpthread.so.0`start_thread(arg=0x00007f3031dde700) at pthread_create.c:333 frame #30: 0x00007f3104df941d libc.so.6`__clone at clone.S:109
NA
2.19.3.0-b136
Duplicate of https://github.com/yugabyte/yugabyte-db/issues/20268
Jira Link: DB-8169
Description
We were able to 2 times (20% of time with original workload of CQL)
As per discussion, memtracker for CDC records needs to be taken a look. As discussed here https://yugabyte.slack.com/archives/C03H4D4EVC6/p1695806406849879
Discussed with @suranjan, that the universe shared already had memtracker cleaned up. So, we need to analyze fresh run. To track this, we can track this with ticket.
Source connector version
NA
Connector configuration
NA
YugabyteDB version
2.19.3.0-b136
Warning: Please confirm that this issue does not contain any sensitive information