yikang-li / FactorizableNet

Factorizable Net (Multi-GPU version): An Efficient Subgraph-based Framework for Scene Graph Generation
216 stars 38 forks source link

Segmentation fault due to _C.so file #26

Closed sandeep-ipk closed 5 years ago

sandeep-ipk commented 5 years ago

Hey @yikang-li,

The _C.so file in lib/layer_utils is giving a segmentation fault in all the files it is being called. For example, the first fault comes in lib/ayer_utils/roi_layers/nms.py (nms = _C.nms) file.

In conclusion, wherever the _C.so file is imported is giving me a segmentation fault. Could you perhaps help me out?

These are my settings: CUDA = 10.1 python = 2.7.16 PyTorch = 1.0.1 gcc = 5.2.0

Thank you very much, Sandeep.

Here is the fault:

`Fatal Python error: Segmentation fault

Thread 0x00007fe32b9ec700 (most recent call first): File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 340 in wait File " /home/xyz/opt/python-2.7.16/lib/python2.7/multiprocessing/queues.py", line 252 in _feed File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 754 in run File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 801 in __bootstrap_inner File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 774 in __bootstrap

Thread 0x00007fe32c550700 (most recent call first): File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 340 in wait File " /home/xyz/opt/python-2.7.16/lib/python2.7/multiprocessing/queues.py", line 252 in _feed File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 754 in run File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 801 in __bootstrap_inner File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 774 in __bootstrap

Thread 0x00007fe32cd51700 (most recent call first): File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 340 in wait File " /home/xyz/opt/python-2.7.16/lib/python2.7/multiprocessing/queues.py", line 252 in _feed File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 754 in run File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 801 in __bootstrap_inner File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 774 in __bootstrap

Thread 0x00007fe3e0932700 (most recent call first): File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 340 in wait File " /home/xyz/opt/python-2.7.16/lib/python2.7/multiprocessing/queues.py", line 252 in _feed File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 754 in run File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 801 in __bootstrap_inner File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 774 in __bootstrap

Thread 0x00007fe3e0131700 (most recent call first): File " /home/xyz/opt/python-2.7.16/lib/python2.7/multiprocessing/queues.py", line 131 in get File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/utils/data/dataloader.py", line 158 in _pin_memory_loop File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 754 in run File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 801 in __bootstrap_inner File " /home/xyz/opt/python-2.7.16/lib/python2.7/threading.py", line 774 in __bootstrap

Current thread 0x00007fe40d711740 (most recent call first): File " /raid/xyz/FactorizableNet/lib/fast_rcnn/nms_wrapper.py", line 26 in nms File " /raid/xyz/FactorizableNet/lib/rpn_msr/proposal_layer.py", line 134 in proposal_layer File " /raid/xyz/FactorizableNet/models/RPN/RPN.py", line 141 in proposal_layer File " /raid/xyz/FactorizableNet/models/RPN/RPN.py", line 115 in forward File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/modules/module.py", line 489 in call File " /raid/xyz/FactorizableNet/models/HDN_v2/factorizable_network_v4.py", line 143 in forward File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/modules/module.py", line 489 in call File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/parallel/parallel_apply.py", line 59 in _worker File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/parallel/parallel_apply.py", line 77 in parallel_apply File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/parallel/data_parallel.py", line 153 in parallel_apply File " /raid/xyz/FactorizableNet/models/modules/dataParallel.py", line 36 in forward File " /raid/xyz/python-environments/ipk3/lib/python2.7/site-packages/torch/nn/modules/module.py", line 489 in call File " /raid/xyz/FactorizableNet/models/HDN_v2/engines_v1.py", line 44 in train File "train_FN.py", line 336 in main File "train_FN.py", line 403 in Segmentation fault`

sandeep-ipk commented 5 years ago

Hey @yikang-li

I solved it by downloading Faster R-CNN PyTorch implementation. After that, I re-ran setup.py of its library, copied the _C.so files created from its compilation to the Factorizable Net's corresponding locations.

I can't think of an explanation of why this worked.

Best, Sandeep.

abhijeetnijampurkar commented 4 years ago

Hey @sandeep-ipk ! Can you precisely mention the steps I need to follow to get rid of this issue? Thanks in advance. :)