NVIDIA / DIGITS

Deep Learning GPU Training System
https://developer.nvidia.com/digits
BSD 3-Clause "New" or "Revised" License
4.12k stars 1.38k forks source link

Error code - 11 (NVIDIA Digits Kitti dataset training) #1957

Open srujandk opened 6 years ago

srujandk commented 6 years ago

I am getting the following error when I followed the tutorial for object detection on https://github.com/NVIDIA/DIGITS/blob/master/examples/object-detection/README.md Please suggest the changes to be made to get this working. Thank You. Log file:

I0124 11:38:38.674772 13649 net.cpp:409] bbox_loss -> loss_bbox I0124 11:38:38.674782 13649 net.cpp:144] Setting up bbox_loss I0124 11:38:38.674787 13649 net.cpp:151] Top shape: (1) I0124 11:38:38.674791 13649 net.cpp:154] with loss weight 2 I0124 11:38:38.674804 13649 net.cpp:159] Memory required for data: 3824634628 I0124 11:38:38.674808 13649 layer_factory.hpp:77] Creating layer coverage_loss I0124 11:38:38.674814 13649 net.cpp:94] Creating Layer coverage_loss I0124 11:38:38.674819 13649 net.cpp:435] coverage_loss <- coverage_coverage/sig_0_split_0 I0124 11:38:38.674824 13649 net.cpp:435] coverage_loss <- coverage-label_slice-label_4_split_0 I0124 11:38:38.674832 13649 net.cpp:409] coverage_loss -> loss_coverage I0124 11:38:38.674841 13649 net.cpp:144] Setting up coverage_loss I0124 11:38:38.674846 13649 net.cpp:151] Top shape: (1) I0124 11:38:38.674850 13649 net.cpp:154] with loss weight 1 I0124 11:38:38.674855 13649 net.cpp:159] Memory required for data: 3824634632 I0124 11:38:38.674860 13649 layer_factory.hpp:77] Creating layer cluster Aborted at 1516774119 (unix time) try "date -d @1516774119" if you are using GNU date PC: @ 0x7f7a3400c873 std::_Hashtable<>::clear() SIGSEGV (@0x9) received by PID 13649 (TID 0x7f7ab7b26ac0) from PID 9; stack trace: @ 0x7f7ab5c234b0 (unknown) @ 0x7f7a3400c873 std::_Hashtable<>::clear() @ 0x7f7a33ffe346 google::protobuf::DescriptorPool::FindFileByName() @ 0x7f7a33fdcac8 google::protobuf::python::cdescriptor_pool::AddSerializedFile() @ 0x7f7ab684e9e0 PyEval_EvalFrameEx @ 0x7f7ab698404c PyEval_EvalCodeEx @ 0x7f7ab68da45d (unknown) @ 0x7f7ab68ad263 PyObject_Call @ 0x7f7ab68cdb65 (unknown) @ 0x7f7ab6864163 (unknown) @ 0x7f7ab68ad263 PyObject_Call @ 0x7f7ab684b34c PyEval_EvalFrameEx @ 0x7f7ab698404c PyEval_EvalCodeEx @ 0x7f7ab6845d99 PyEval_EvalCode @ 0x7f7ab68e7234 PyImport_ExecCodeModuleEx @ 0x7f7ab68e7c0f (unknown) @ 0x7f7ab68e9380 (unknown) @ 0x7f7ab68e9648 (unknown) @ 0x7f7ab68ea75b PyImport_ImportModuleLevel @ 0x7f7ab68548a8 (unknown) @ 0x7f7ab68ad263 PyObject_Call @ 0x7f7ab6983477 PyEval_CallObjectWithKeywords @ 0x7f7ab68497d6 PyEval_EvalFrameEx @ 0x7f7ab698404c PyEval_EvalCodeEx @ 0x7f7ab6845d99 PyEval_EvalCode @ 0x7f7ab68e7234 PyImport_ExecCodeModuleEx @ 0x7f7ab68e7c0f (unknown) @ 0x7f7ab68e9380 (unknown) @ 0x7f7ab68e9648 (unknown) @ 0x7f7ab68ea75b PyImport_ImportModuleLevel @ 0x7f7ab68548a8 (unknown) @ 0x7f7ab68ad263 PyObject_Call

JadBatmobile commented 6 years ago

Found a Solution?