apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.31k stars 3.21k forks source link

[Bug] failed to migrate storage medium error=[NOT_IMPLEMENTED_ERROR] #26239

Open lordk911 opened 10 months ago

lordk911 commented 10 months ago

Search before asking

Version

2.0.2

What's Wrong?

fe master and be‘s log file keep print warn message:

fe: WARN (thrift-server-pool-48167|234886) [MasterImpl.finishTask():93] finish task reports bad. request: TFinishTaskRequest(backend:TBackend(host:10.x.x.x, be_port:9060, http_port:8040), task_type:STORAGE_MEDIUM_MIGRATE, signature:860374, task_status:TStatus(status_code:NOT_IMPLEMENTED_ERROR, error_msgs:[(10.x.x.x)[NOT_IMPLEMENTED_ERROR]cannot copy remote files, rowset_id=0200000000235e8ed445d151ca1906ec460909edb10d9a85]))

be: `W1101 18:26:46.012589 15320 task_worker_pool.cpp:1925] failed to migrate storage medium|signature=861366|tablet_id=861366|error=[NOT_IMPLEMENTED_ERROR]cannot copy remote files, rowset_id=0200000000085e76aa4db16eeb1ccf54bd036b3d33669db2

  1. /root/src/doris-2.0/be/src/common/stack_trace.cpp:302: StackTrace::tryCapture() @ 0x000000000ba1f197 in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  2. /root/src/doris-2.0/be/src/common/stack_trace.h:0: doris::get_stack_trace[abi:cxx11]() @ 0x000000000ba1d72d in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  3. /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/unique_ptr.h:173: doris::Status doris::Status::Error<true, std::__cxx11::basic_string<char, std::char_traits, std::allocator > >(int, std::basic_string_view<char, std::char_traits >, std::__cxx11::basic_string<char, std::char_traits, std::allocator >&&) @ 0x000000000ae806e2 in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  4. /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/basic_string.h:187: doris::BetaRowset::copy_files_to(std::__cxx11::basic_string<char, std::char_traits, std::allocator > const&, doris::RowsetId const&) @ 0x000000000b4c8271 in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  5. /root/src/doris-2.0/be/src/common/status.h:432: doris::EngineStorageMigrationTask::_migrate() @ 0x000000000b7a4c7e in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  6. /root/src/doris-2.0/be/src/olap/task/engine_storage_migration_task.cpp:60: doris::EngineStorageMigrationTask::execute() @ 0x000000000b7a473e in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  7. /root/src/doris-2.0/be/src/common/status.h:432: doris::StorageEngine::execute_task(doris::EngineTask*) @ 0x000000000b64c739 in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  8. /root/src/doris-2.0/be/src/common/status.h:348: doris::StorageMediumMigrateTaskPool::_storage_medium_migrate_worker_thread_callback() @ 0x000000000b6988d7 in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  9. /root/src/doris-2.0/be/src/util/threadpool.cpp:0: doris::ThreadPool::dispatch_thread() @ 0x000000000ba5bdaf in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  10. /var/local/ldb-toolchain/bin/../usr/include/pthread.h:562: doris::Thread::supervise_thread(void*) @ 0x000000000ba51d3c in /data/soft/doris/doris-2.0.2/be/lib/doris_be
  11. start_thread @ 0x0000000000007ea5 in /usr/lib64/libpthread-2.17.so
  12. __clone @ 0x00000000000feb0d in /usr/lib64/libc-2.17.so`

fe.conf :

default_storage_medium = SSD
enable_storage_policy = true

be.conf: storage_root_path = /grid/0/dorish,medium:SSD,2500;/grid/1/dorish,medium:SSD,2500;/grid/2/dorish,medium:SSD,2500;/grid/3/dorish,medium:SSD,2500;/grid/4/dorish,medium:SSD,2500;/grid/5/dorish,medium:SSD,2500;/grid/6/dorish,medium:SSD,2500;/grid/7/dorish,medium:SSD,2500;/grid/8/dorish,medium:SSD,2500

table property:

PROPERTIES ( "replication_allocation" = "tag.location.default: 3", "is_being_synced" = "false", "dynamic_partition.enable" = "true", "dynamic_partition.time_unit" = "DAY", "dynamic_partition.time_zone" = "Asia/Shanghai", "dynamic_partition.start" = "-365", "dynamic_partition.end" = "7", "dynamic_partition.prefix" = "p", "dynamic_partition.replication_allocation" = "tag.location.default: 3", "dynamic_partition.buckets" = "8", "dynamic_partition.create_history_partition" = "true", "dynamic_partition.history_partition_num" = "365", "dynamic_partition.hot_partition_num" = "0", "dynamic_partition.reserved_history_periods" = "NULL", "dynamic_partition.storage_policy" = "", "dynamic_partition.storage_medium" = "SSD", "storage_format" = "V2", "light_schema_change" = "true", "disable_auto_compaction" = "false", "enable_single_replica_compaction" = "false" );

What You Expected?

what is STORAGE_MEDIUM_MIGRATE task , why this happen

How to Reproduce?

No response

Anything Else?

No response

Are you willing to submit PR?

Code of Conduct

lordk911 commented 10 months ago

@yiguolei @Tanya-W do you know why this happen?

353355756 commented 4 months ago

@lordk911 Encountered the same problem, has it been resolved? 遇到相同问题了,是否已经解决了?