How to use a cfg? - Githubissues

ekarabulut commented 4 years ago

Hi,

I've been using Darknet (Alexey) for a while and I am familiar with how to configure a cfg for a custom training for object detection. This repository is suggested to me and the claimed results seem promising.

But, I could not find how to use a cfg for a training. For instance, I could not figure out how to adapt the cfg to number of classes I'd like to train for. In regular Darknet, one has to configure some parameters in Yolo layer (number of filters, classes etc) in order to adapt the cfg to a custom training.

Any help is appreciated

AlexeyAB commented 4 years ago

@ekarabulut Hi,

Do the same thing: https://github.com/AlexeyAB/darknet#how-to-train-to-detect-your-custom-objects

ekarabulut commented 4 years ago

Hi

Thank you for the reply. In Darknet almost all configuration related to number of classes happen in the Yolo layers and Convolutional layers before a Yolo layer. In - for example - CSPDarknet53 or CSPDarknet53-Elastic cfg files, how can I set number of classes in such cfg files?

AlexeyAB commented 4 years ago

CSPDarknet53 or CSPDarknet53-Elastic

These models are Classifiers for ImageNet dataset.

Look at Detectors for MS COCO dataset: https://github.com/WongKinYiu/CrossStagePartialNetworks#ms-coco

Or these models: https://github.com/WongKinYiu/CrossStagePartialNetworks/tree/master/in%20progress

WongKinYiu commented 4 years ago

sorry, wrong click.

Rajasekhar06 commented 4 years ago

Using instructions for custom object detections using yolov3 instructions throws an error like this

./darknet detector train data/yolov3-spp-3cls.data cfg/csresnext50-panet-spp.cfg -gpus 0,1,2,3

0,1,2,3 csresnext50-panet-spp compute_capability = net.optimized_memory = 0 batch = 4, time_steps = 1, train = 1 layer filters size/strd(dil) 0 conv 64 7 x 7/ 2 1 max 2x 2/ 2 2 conv 128 1 x 1/ 1 3 route 1 4 conv 64 1 x 1/ 1 5 conv 128 1 x 1/ 1 6 conv 128/ 32 3 x 3/ 1 7 conv 128 1 x 1/ 1 8 Shortcut Layer: 4 w = 152, w2 = 152, h 9 conv 128 1 x 1/ 1 10 conv 128/ 32 3 x 3/ 1 11 conv 128 1 x 1/ 1 12 Shortcut Layer: 8 13 conv 128 1 x 1/ 1 14 conv 128/ 32 3 x 3/ 1 15 conv 128 1 x 1/ 1 16 Shortcut Layer: 12 17 conv 128 1 x 1/ 1 18 route 17 2 19 conv 256 1 x 1/ 1 20 conv 256/ 32 3 x 3/ 2 21 conv 256 1 x 1/ 1 22 route 20 23 conv 256 1 x 1/ 1 24 conv 256 1 x 1/ 1 25 conv 256/ 32 3 x 3/ 1 26 conv 256 1 x 1/ 1 27 Shortcut Layer: 23 28 conv 256 1 x 1/ 1 29 conv 256/ 32 3 x 3/ 1 30 conv 256 1 x 1/ 1 31 Shortcut Layer: 27 32 conv 256 1 x 1/ 1 33 conv 256/ 32 3 x 3/ 1 34 conv 256 1 x 1/ 1 35 Shortcut Layer: 31 36 conv 256 1 x 1/ 1 37 route 36 21 38 conv 512 1 x 1/ 1 39 conv 512/ 32 3 x 3/ 2 40 conv 512 1 x 1/ 1 41 route 39 42 conv 512 1 x 1/ 1 43 conv 512 1 x 1/ 1 44 conv 512/ 32 3 x 3/ 1 45 conv 512 1 x 1/ 1 46 Shortcut Layer: 42 47 conv 512 1 x 1/ 1 48 conv 512/ 32 3 x 3/ 1 49 conv 512 1 x 1/ 1 50 Shortcut Layer: 46 51 conv 512 1 x 1/ 1 52 conv 512/ 32 3 x 3/ 1 53 conv 512 1 x 1/ 1 54 Shortcut Layer: 50 55 conv 512 1 x 1/ 1 56 conv 512/ 32 3 x 3/ 1 57 conv 512 1 x 1/ 1 58 Shortcut Layer: 54 59 conv 512 1 x 1/ 1 60 conv 512/ 32 3 x 3/ 1 61 conv 512 1 x 1/ 1 62 Shortcut Layer: 58 63 conv 512 1 x 1/ 1 64 route 63 40 65 conv 1024 1 x 1/ 1 66 conv 1024/ 32 3 x 3/ 2 67 conv 1024 1 x 1/ 1 68 route 66 69 conv 1024 1 x 1/ 1 70 conv 1024 1 x 1/ 1 71 conv 1024/ 32 3 x 3/ 1 72 conv 1024 1 x 1/ 1 73 Shortcut Layer: 69 74 conv 1024 1 x 1/ 1 75 conv 1024/ 32 3 x 3/ 1 76 conv 1024 1 x 1/ 1 77 Shortcut Layer: 73 78 conv 1024 1 x 1/ 1 79 route 78 67 80 conv 2048 1 x 1/ 1 81 conv 512 1 x 1/ 1 82 conv 1024 3 x 3/ 1 83 conv 512 1 x 1/ 1 84 max 5x 5/ 1 85 route 83 86 max 9x 9/ 1 87 route 83 88 max 13x13/ 1 89 route 88 86 84 83 90 conv 512 1 x 1/ 1 91 conv 1024 3 x 3/ 1 92 conv 512 1 x 1/ 1 93 conv 256 1 x 1/ 1 94 upsample 95 route 65 96 conv 256 1 x 1/ 1 97 route 96 94 98 conv 256 1 x 1/ 1 99 conv 512 3 x 3/ 1 100 conv 256 1 x 1/ 1 101 conv 512 3 x 3/ 1 102 conv 256 1 x 1/ 1 103 conv 128 1 x 1/ 1 104 upsample 105 route 38 106 conv 128 1 x 1/ 1 107 route 106 104 108 conv 128 1 x 1/ 1 109 conv 256 3 x 3/ 1 110 conv 128 1 x 1/ 1 111 conv 256 3 x 3/ 1 112 conv 128 1 x 1/ 1 113 conv 256 3 x 3/ 1 114 conv 21 1 x 1/ 1 115 yolo 750, cudnn_half = 1 input output 608 x 608 x 3 -> 304 x 304 x 64 1.739 BF 304 x 304 x 64 -> 152 x 152 x 64 0.006 BF 152 x 152 x 64 -> 152 x 152 x 128 0.379 BF -> 152 x 152 x 64 152 x 152 x 64 -> 152 x 152 x 64 0.189 BF 152 x 152 x 64 -> 152 x 152 x 128 0.379 BF 152 x 152 x 128 -> 152 x 152 x 128 0.213 BF 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF = 152, h2 = 152, c = 128, c2 = 64 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF 152 x 152 x 128 -> 152 x 152 x 128 0.213 BF 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF 152 x 152 x 128 -> 152 x 152 x 128 0.213 BF 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF 152 x 152 x 128 -> 152 x 152 x 128 0.757 BF -> 152 x 152 x 256 152 x 152 x 256 -> 152 x 152 x 256 3.028 BF 152 x 152 x 256 -> 76 x 76 x 256 0.213 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF -> 76 x 76 x 256 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.213 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.213 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.213 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF 76 x 76 x 256 -> 76 x 76 x 256 0.757 BF -> 76 x 76 x 512 76 x 76 x 512 -> 76 x 76 x 512 3.028 BF 76 x 76 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF -> 38 x 38 x 512 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.213 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF 38 x 38 x 512 -> 38 x 38 x 512 0.757 BF -> 38 x 38 x1024 38 x 38 x1024 -> 38 x 38 x1024 3.028 BF 38 x 38 x1024 -> 19 x 19 x1024 0.213 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF -> 19 x 19 x1024 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF 19 x 19 x1024 -> 19 x 19 x1024 0.213 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF 19 x 19 x1024 -> 19 x 19 x1024 0.213 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF 19 x 19 x1024 -> 19 x 19 x1024 0.757 BF -> 19 x 19 x2048 19 x 19 x2048 -> 19 x 19 x2048 3.028 BF 19 x 19 x2048 -> 19 x 19 x 512 0.757 BF 19 x 19 x 512 -> 19 x 19 x1024 3.407 BF 19 x 19 x1024 -> 19 x 19 x 512 0.379 BF 19 x 19 x 512 -> 19 x 19 x 512 0.005 BF -> 19 x 19 x 512 19 x 19 x 512 -> 19 x 19 x 512 0.015 BF -> 19 x 19 x 512 19 x 19 x 512 -> 19 x 19 x 512 0.031 BF -> 19 x 19 x2048 19 x 19 x2048 -> 19 x 19 x 512 0.757 BF 19 x 19 x 512 -> 19 x 19 x1024 3.407 BF 19 x 19 x1024 -> 19 x 19 x 512 0.379 BF 19 x 19 x 512 -> 19 x 19 x 256 0.095 BF 2x 19 x 19 x 256 -> 38 x 38 x 256 -> 38 x 38 x1024 38 x 38 x1024 -> 38 x 38 x 256 0.757 BF -> 38 x 38 x 512 38 x 38 x 512 -> 38 x 38 x 256 0.379 BF 38 x 38 x 256 -> 38 x 38 x 512 3.407 BF 38 x 38 x 512 -> 38 x 38 x 256 0.379 BF 38 x 38 x 256 -> 38 x 38 x 512 3.407 BF 38 x 38 x 512 -> 38 x 38 x 256 0.379 BF 38 x 38 x 256 -> 38 x 38 x 128 0.095 BF 2x 38 x 38 x 128 -> 76 x 76 x 128 -> 76 x 76 x 512 76 x 76 x 512 -> 76 x 76 x 128 0.757 BF -> 76 x 76 x 256 76 x 76 x 256 -> 76 x 76 x 128 0.379 BF 76 x 76 x 128 -> 76 x 76 x 256 3.407 BF 76 x 76 x 256 -> 76 x 76 x 128 0.379 BF 76 x 76 x 128 -> 76 x 76 x 256 3.407 BF 76 x 76 x 256 -> 76 x 76 x 128 0.379 BF 76 x 76 x 128 -> 76 x 76 x 256 3.407 BF 76 x 76 x 256 -> 76 x 76 x 21 0.062 BF

Error: l.outputs == params.inputs filters= in the [convolutional]-layer doesn't correspond to classes= or mask= in [yolo]-layer

How to solve this? like how many filters are to be set for new network I made changes as per older method like filters and classes and steps but it is different for this!

WongKinYiu commented 4 years ago

@Rajasekhar06

plz follow https://github.com/AlexeyAB/darknet#how-to-train-to-detect-your-custom-objects

according to ur .data file name "yolov3-spp-3cls.data", i assume there r 3 classes in ur dataset. so the number of filter of convolutional layers which r just before yolo layers should be ((classes + 5)x3) = ((3 + 5)x3) = 24, however, it is set to 21 in ur cfg file.

Rajasekhar06 commented 4 years ago

@Rajasekhar06

plz follow https://github.com/AlexeyAB/darknet#how-to-train-to-detect-your-custom-objects

according to ur .data file name "yolov3-spp-3cls.data", i assume there r 3 classes in ur dataset. so the number of filter of convolutional layers which r just before yolo layers should be ((classes + 5)x3) = ((3 + 5)x3) = 24, however, it is set to 21 in ur cfg file.

Thanks So much I made this change so many times but this time I forgot to do that its working

Rajasekhar06 commented 4 years ago

@WongKinYiu I have another small query How to set max batches in multi gpu training on single gpu we do that multiplying class with 2000 but the README.md is a little confusing for steps, max_batches

WongKinYiu commented 4 years ago

@Rajasekhar06

becuz it is depend on ur dataset size and task difficulty. usually, we set it near to 300*num_of_images/batch_size.

Rajasekhar06 commented 4 years ago

@WongKinYiu I have completed training by setting old style of max batched like max_batches=2000*No. of classes which is suited for Single GPU training with yolov3-spp config after 6000 iterations then avg.loss was 0.7xx but this time I used Multiple GPUs(4 x TITAN RTX 24GB each) and your Config file CSPnet where avg.loss was above 1.xxx after 6000 iterations but burn_in=41000=4000(Might be burn_in is the reason for not decreasing avg.loss as i have set this parameter as per multi-GPU but not the iterations as below) for 3 classes dataset, so I re-started training with this `2000num.of GPUsnum.of classesparameters and respective step sizes (80% and 90%) from the last checkpoint results are yet to come I will update you once after training completes. if this did not work then I will set the parameters defined by you300num_of_images/batch_sizeand train.(BTW for multi-GPU what about Burn-In currently I set it tonum.of GPUs * 1000` which blocks the use of tensor cores till half of the max batches)

Thanks for your time and effort :100:

shashwat1998 commented 3 years ago

@WongKinYiu can these cfg files be used in pytorch models?

WongKinYiu commented 3 years ago

yes.

shashwat1998 commented 3 years ago

yes.

Can you please give some details on how to do it too?

WongKinYiu commented 3 years ago

here provide some examples.

shashwat1998 commented 3 years ago

here provide some examples.

Thank You!

WongKinYiu / CrossStagePartialNetworks

How to use a cfg? #4