I ran tuning on an ONNX file using python and tvmc API, but after reaching half of the tasks it throws an error which stops the tuning and makes you start from the beginning (happened multiple times)
# Perform actual tuning with selected tasks
tvmc.tune(
model,
target=target,
tuning_records=tuning_records,
enable_autoscheduler=args.enable_autoscheduler,
trials=args.tuning_trials,
early_stopping=args.early_stopping,
timeout=20,
)
print("Tuning completed.")
terminate called after throwing an instance of 'tvm::runtime::InternalError'
what(): [13:54:11] /home/ubuntu/tvm/src/runtime/cuda/cuda_device_api.cc:312: InternalError: Check failed: (e == cudaSuccess || e == cudaErrorCudartUnloading) is false: CUDA: misaligned address
Stack trace:
0: tvm::runtime::CUDATimerNode::~CUDATimerNode()
at /home/ubuntu/tvm/src/runtime/cuda/cuda_device_api.cc:312
1: tvm::runtime::SimpleObjAllocator::Handler<tvm::runtime::CUDATimerNode>::Deleter_(tvm::runtime::Object*)
at /home/ubuntu/tvm/include/tvm/runtime/memory.h:138
2: tvm::runtime::ObjectPtr<tvm::runtime::Object>::reset()
at /home/ubuntu/tvm/include/tvm/runtime/object.h:455
3: tvm::runtime::ObjectPtr<tvm::runtime::Object>::~ObjectPtr()
at /home/ubuntu/tvm/include/tvm/runtime/object.h:404
4: tvm::runtime::ObjectRef::~ObjectRef()
at /home/ubuntu/tvm/include/tvm/runtime/object.h:519
5: tvm::runtime::Timer::~Timer()
at /home/ubuntu/tvm/include/tvm/runtime/profiling.h:86
6: operator()
at /home/ubuntu/tvm/src/runtime/profiling.cc:915
7: tvm::runtime::LocalSession::CallFunc(void*, TVMValue const*, int const*, int, std::function<void (tvm::runtime::TVMArgs)> const&)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_local_session.cc:107
8: tvm::runtime::RPCSession::AsyncCallFunc(void*, TVMValue const*, int const*, int, std::function<void (tvm::runtime::RPCCode, tvm::runtime::TVMArgs)>)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_session.cc:47
9: tvm::runtime::RPCEndpoint::EventHandler::HandleNormalCallFunc()
at /home/ubuntu/tvm/src/runtime/rpc/rpc_endpoint.cc:542
10: tvm::runtime::RPCEndpoint::EventHandler::HandleProcessPacket(std::function<void (tvm::runtime::TVMArgs)>)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_endpoint.cc:362
11: tvm::runtime::RPCEndpoint::EventHandler::HandleNextEvent(bool, bool, std::function<void (tvm::runtime::TVMArgs)>)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_endpoint.cc:136
12: tvm::runtime::RPCEndpoint::HandleUntilReturnEvent(bool, std::function<void (tvm::runtime::TVMArgs)>)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_endpoint.cc:714
13: tvm::runtime::RPCEndpoint::ServerLoop()
at /home/ubuntu/tvm/src/runtime/rpc/rpc_endpoint.cc:805
14: tvm::runtime::RPCServerLoop(int)
at /home/ubuntu/tvm/src/runtime/rpc/rpc_socket_impl.cc:119
15: operator()
at /home/ubuntu/tvm/src/runtime/rpc/rpc_socket_impl.cc:138
Exception in thread Thread-1 (_listen_loop):
Traceback (most recent call last):
File "/home/ubuntu/miniconda3/envs/tvm-build-venv/lib/python3.11/threading.py", line 1045, in _bootstrap_inner
self.run()
File "/home/ubuntu/miniconda3/envs/tvm-build-venv/lib/python3.11/threading.py", line 982, in run
self._target(*self._args, **self._kwargs)
File "/home/ubuntu/tvm/python/tvm/rpc/server.py", line 279, in _listen_loop
_serving(conn, addr, opts, load_library)
File "/home/ubuntu/tvm/python/tvm/rpc/server.py", line 168, in _serving
raise RuntimeError(
RuntimeError: Child process 49293 exited unsuccessfully with error code -6
I ran tuning on an ONNX file using python and
tvmc
API, but after reaching half of the tasks it throws an error which stops the tuning and makes you start from the beginning (happened multiple times)Expected behavior
To complete all tasks for tuning
Actual behavior
We get an error:
Environment