vincentgong7 / VG_AlexeyAB_darknet

A forked AlexeyAB Darknet repo with extra convenient functions.
https://darknet.gong.im
Other
84 stars 41 forks source link

Segmentation fault core dump when detecting 2 classes #27

Open Timbimjim opened 3 years ago

Timbimjim commented 3 years ago

Hey Vincent,

Thx for the great fork! I have a dataset train for 22 different classes. I used your fork and it went super smooth through 600 test images, saved the predictions etc pp.

Then I changed the classes for my dataset to only two classes (same images though), trained again and tried your fork for batch predictions.

Now after it runs through 4-5 images I keep getting the segmention fault core dump error.

I really don't know what the problem is. I did everything the same, only differenz is the new weight file and the fact it's only 2 classes now.

My training went without problems. I also didn't change anything important in the training (ofc I changed classes, filters etc. Though)

I Google a lot, tried out many different things but nothing works.

Any idea what might caus this? Seems like my weight file might be corrupted or so? It successfully loads it though.

Thx for any help

vincentgong7 commented 3 years ago

Hi Timbimjim,

Nice to know it helps in your project. Could you try your weight file in the original version of Yolo which my version based on? If it also corrupt, it might because of your weight file.

To the general questions about wight file trained by each user themselves, I'll take a time to check thoroughly. But recently I'm very busy.

Regards,

Vincent X. Gong

On Thu, Jan 14, 2021 at 4:45 AM Timbimjim notifications@github.com wrote:

Hey Vincent,

Thx for the great fork! I have a dataset train for 22 different classes. I used your fork and it went super smooth through 600 test images, saved the predictions etc pp.

Then I changed the classes for my dataset to only two classes (same images though), trained again and tried your fork for batch predictions.

Now after it runs through 4-5 images I keep getting the segmention fault core dump error.

I really don't know what the problem is. I did everything the same, only differenz is the new weight file and the fact it's only 2 classes now.

My training went without problems. I also didn't change anything important in the training (ofc I changed classes, filters etc. Though)

I Google a lot, tried out many different things but nothing works.

Any idea what might caus this? Seems like my weight file might be corrupted or so? It successfully loads it though.

Thx for any help

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/vincentgong7/VG_AlexeyAB_darknet/issues/27, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABSXEI2GKCP7KCXKO7IEPGDSZZSHBANCNFSM4WBZVWIQ .

Timbimjim commented 3 years ago

Hey Vincent,

The Problem was resolved. I changed random = 0 and max batch to 6000, it was 4000 before, and trained again. I was using roboflow Google colab where it automatically generates the cfg for you by multiplying the classes with 2000 for maxbatch. But according to Alexey this has to be at least 6000 or bigger.

So was either this or the random = 1,which overloaded the gpu I guess.

Works now fine. Thx for your reply anyway

Have a good day mate

vincentgong7 commented 3 years ago

Nice to know that! Thanks and have a good day.

BR, Vincent

On Sat, 16 Jan 2021 at 11:41, Timbimjim notifications@github.com wrote:

Hey Vincent,

The Problem was resolved. I changed random = 0 and max batch to 6000, it was 4000 before, and trained again. I was using roboflow Google colab where it automatically generates the cfg for you by multiplying the classes with 2000 for maxbatch. But according to Alexey this has to be at least 6000 or bigger.

So was either this or the random = 1,which overloaded the gpu I guess.

Works now fine. Thx for your reply anyway

Have a good day mate

— You are receiving this because you commented.

Reply to this email directly, view it on GitHub https://github.com/vincentgong7/VG_AlexeyAB_darknet/issues/27#issuecomment-761542085, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABSXEI4EJOEJ7FCDEWTMDEDS2FUOHANCNFSM4WBZVWIQ .

--

Regards,

Vincent X. Gong