apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.77k stars 3.28k forks source link

[Bug] [CANCELLED]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409 #42021

Open zh0122 opened 1 month ago

zh0122 commented 1 month ago

Search before asking

Version

2.1.6

What's Wrong?

Returns an error when the sql is executed:

[ERROR] 2024-10-17 12:37:46.013  - [taskAppId=TASK-908-19526539-86206332]:[265] - execute sql error: errCode = 2, detailMessage = (10.181.26.31)[CANCELLED]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409
[ERROR] 2024-10-17 12:37:46.013  - [taskAppId=TASK-908-19526539-86206332]:[167] - sql task error: java.sql.SQLException: errCode = 2, detailMessage = (10.181.26.31)[CANCELLED]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409

log in be.INFO:

W20241017 12:37:45.995843 147271 status.h:413] meet error status: [INTERNAL_ERROR]fail to find path in version_graph. spec_version: 0-129344

        0#  doris::VersionGraph::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*) const at /home/zcp/repo_center/doris_release/doris/be/src/olap/version_graph.cpp:627
        1#  doris::TimestampedVersionTracker::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*) const at /home/zcp/repo_center/doris_release/doris/be/src/olap/version_graph.cpp:330
        2#  doris::Tablet::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*, bool, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        3#  doris::Tablet::capture_rs_readers(doris::Version const&, std::vector<doris::RowSetSplits, std::allocator<doris::RowSetSplits> >*, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        4#  doris::vectorized::NewOlapScanner::init() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        5#  doris::vectorized::ScannerScheduler::_scanner_scan(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        6#  std::_Function_handler<void (), doris::vectorized::ScannerScheduler::submit(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>)::$_1::operator()() const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/shared_ptr_base.h:701
        7#  doris::ThreadPool::dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/util/threadpool.cpp:0
        8#  doris::Thread::supervise_thread(void*) at /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562
        9#  start_thread
        10# clone
W20241017 12:37:45.995951 147271 tablet.cpp:952] tablet:116827409, version already has been merged. spec_version: [0-129344], max_version: [129368-129368]
W20241017 12:37:46.005101 147271 status.h:413] meet error status: [E-230]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409

        0#  doris::Tablet::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*, bool, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        1#  doris::Tablet::capture_rs_readers(doris::Version const&, std::vector<doris::RowSetSplits, std::allocator<doris::RowSetSplits> >*, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        2#  doris::vectorized::NewOlapScanner::init() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        3#  doris::vectorized::ScannerScheduler::_scanner_scan(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        4#  std::_Function_handler<void (), doris::vectorized::ScannerScheduler::submit(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>)::$_1::operator()() const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/shared_ptr_base.h:701
        5#  doris::ThreadPool::dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/util/threadpool.cpp:0
        6#  doris::Thread::supervise_thread(void*) at /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562
        7#  start_thread
        8#  clone
W20241017 12:37:46.005203 147271 new_olap_scanner.cpp:191] fail to init reader.res=[E-230]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409

        0#  doris::Tablet::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*, bool, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        1#  doris::Tablet::capture_rs_readers(doris::Version const&, std::vector<doris::RowSetSplits, std::allocator<doris::RowSetSplits> >*, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        2#  doris::vectorized::NewOlapScanner::init() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        3#  doris::vectorized::ScannerScheduler::_scanner_scan(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        4#  std::_Function_handler<void (), doris::vectorized::ScannerScheduler::submit(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>)::$_1::operator()() const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/shared_ptr_base.h:701
        5#  doris::ThreadPool::dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/util/threadpool.cpp:0
        6#  doris::Thread::supervise_thread(void*) at /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562
        7#  start_thread
        8#  clone
W20241017 12:37:46.005328 147253 task_scheduler.cpp:361] Pipeline task failed. query_id: 7d8ad29c3464420-9fa214771b21fce5|0-0 reason: [E-230]missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409

        0#  doris::Tablet::capture_consistent_versions(doris::Version const&, std::vector<doris::Version, std::allocator<doris::Version> >*, bool, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        1#  doris::Tablet::capture_rs_readers(doris::Version const&, std::vector<doris::RowSetSplits, std::allocator<doris::RowSetSplits> >*, bool) const at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        2#  doris::vectorized::NewOlapScanner::init() at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:491
        3#  doris::vectorized::ScannerScheduler::_scanner_scan(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>) at /home/zcp/repo_center/doris_release/doris/be/src/common/status.h:378
        4#  std::_Function_handler<void (), doris::vectorized::ScannerScheduler::submit(std::shared_ptr<doris::vectorized::ScannerContext>, std::shared_ptr<doris::vectorized::ScanTask>)::$_1::operator()() const::{lambda()#1}>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/shared_ptr_base.h:701
        5#  doris::ThreadPool::dispatch_thread() at /home/zcp/repo_center/doris_release/doris/be/src/util/threadpool.cpp:0
        6#  doris::Thread::supervise_thread(void*) at /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562
        7#  start_thread
        8#  clone
I20241017 12:37:46.005442 147253 pipeline_x_fragment_context.cpp:141] PipelineXFragmentContext::cancel|query_id=7d8ad29c3464420-9fa214771b21fce5|fragment_id=2|reason=3|error message=missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409
I20241017 12:37:46.005458 147253 pipeline_x_fragment_context.cpp:141] PipelineXFragmentContext::cancel|query_id=7d8ad29c3464420-9fa214771b21fce5|fragment_id=4|reason=3|error message=missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409
W20241017 12:37:46.005467 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fe73
W20241017 12:37:46.005494 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fce7
W20241017 12:37:46.005506 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fce8
W20241017 12:37:46.005515 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fce9
W20241017 12:37:46.005525 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fcea
W20241017 12:37:46.005535 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fceb
W20241017 12:37:46.005548 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fcec
W20241017 12:37:46.005559 147253 pipeline_x_fragment_context.cpp:162] PipelineXFragmentContext cancel instance: 7d8ad29c3464420-9fa214771b21fced
W20241017 12:37:46.005587 147346 scanner_scheduler.cpp:272] Scan thread read VScanner failed: [CANCELLED]cancelled
W20241017 12:37:46.005574 147335 scanner_scheduler.cpp:272] Scan thread read VScanner failed: [CANCELLED]cancelled
I20241017 12:37:46.005713 147251 fragment_mgr.cpp:715] Removing query 7d8ad29c3464420-9fa214771b21fce5 instance 7d8ad29c3464420-9fa214771b21fe73, all done? false
W20241017 12:37:46.005760 188611 fragment_mgr.cpp:538] report error status: missed_versions is empty, spec_version 129344, max_version 129368, tablet_id 116827409 to coordinator: TNetworkAddress(hostname=10.181.26.34, port=9020), query id: 7d8ad29c3464420-9fa214771b21fce5, instance id: 0-0

What You Expected?

sql works fine.

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

Code of Conduct

jiyulongxu commented 3 weeks ago

我也遇见这个问题了,怎么处理?