AITTSMD / MTCNN-Tensorflow

Reproduce MTCNN using Tensorflow
1.51k stars 713 forks source link

Process KILLED issue in gen_hard_example.py #277

Closed Zepyhrus closed 5 years ago

Zepyhrus commented 5 years ago

""" time cost in average1.144 pnet 1.144 rnet 0.000 onet 0.000 boxes length: 12880 finish detecting save_path is : ../images/no_LM12/RNet Killed """

Hi mate, thanks for your open-source, brilliant work. I got this issue when I was generating training examples for RNet.

I'm using:

Any idea of this? May this be related to OOM or how may I solve it?

Zepyhrus commented 5 years ago

This is so frustrating, each time the test part takes around 3 hours and got killed during the pickle.dump steps, is it because of the huge size of detections, which is generated from mtcnn_detector.detect_face(test_data)?

Zepyhrus commented 5 years ago

Problem solved. This is what the author doing: Load images from disk explicitly -> generate the boxes (At this time, all the boxes are cached in memory) -> dump boxes to hard disk for the convenience of debugging -> restore the dumped pickle file for future hard example images saving.

It is so hard to believe that someone would save a intermediate variable to disk just for debugging...

yaoyao14 commented 5 years ago

@Zepyhrus Hey I'm sorry to borther you , but I'm facing the same problem , and I don't know how to fix it , can you please tell me what I should do to solve it ? Thank you so much

Zepyhrus commented 5 years ago

@yaoyao14 Hi, here is the thing, hope this still helps: