kame-lqm commented 4 years ago

I set 'random=1' in cfg file, and train custom dataset, and then I set 416x416 in cfg file to detect objs, but it can detect most of the small size objs, can not detect the one of big size. And when I set 544x544 in cfg file, most of the big size objs can be detected. That's quite weird, and why? Is there anyone know what happen? And How can I detect the big size objs in 416x416? Thanks in advance.

416x416 in cfg: 1480113_416x416

544x544 in cfg: 1480113_544x544

AlexeyAB commented 4 years ago

https://github.com/AlexeyAB/darknet#how-to-improve-object-detection

for each object which you want to detect - there must be at least 1 similar object in the Training dataset with about the same: shape, side of object, relative size, angle of rotation, tilt, illumination. So desirable that your training dataset include images with objects at diffrent: scales, rotations, lightings, from different sides, on different backgrounds - you should preferably have 2000 different images for each class or more, and you should train 2000*classes iterations or more

kame-lqm commented 4 years ago

https://github.com/AlexeyAB/darknet#how-to-improve-object-detection

for each object which you want to detect - there must be at least 1 similar object in the Training dataset with about the same: shape, side of object, relative size, angle of rotation, tilt, illumination. So desirable that your training dataset include images with objects at diffrent: scales, rotations, lightings, from different sides, on different backgrounds - you should preferably have 2000 different images for each class or more, and you should train 2000*classes iterations or more

Thanks for your reply, but I still couldn't understand what is the root cause. And I try to describe my problem in details:

I used the Visdrone2018 dataset and part of the WiderPerson dataset as my dataset, and there are more than 60 similar images as above two images in training dataset. And there are all kind of cars and person in this dataset. There are more than 18000 images in training dataset, and more than 10 cars in each image in average. Although I set 'classes=80' in cfg, it is only 11 classes in my dataset. And I already trained it more than 60, 000 itrs. So, I guess it is not the problem of dataset, maybe the problem from my cfg file. I attach my cfg file here, and hopefully to help me to take a look. Thanks so much.

-----------------------------------------------------------------------------------------------------------

[net]

Testing

batch=1

subdivisions=1

Training batch=64 subdivisions=16

width=544 height=544 channels=3 momentum=0.9 decay=0.0005 angle=0 saturation = 1.5 exposure = 1.5 hue=.1

learning_rate=0.001 burn_in=1000 max_batches = 60000 policy=steps steps=10000,20000 scales=.1,.1

[convolutional] batch_normalize=1 filters=32 size=3 stride=1 pad=1 activation=leaky

Downsample [convolutional] batch_normalize=1 filters=64 size=3 stride=2 pad=1 activation=leaky

[convolutional] batch_normalize=1 filters=32 size=1 stride=1 pad=1 activation=leaky

[convolutional] batch_normalize=1 filters=64 size=3 stride=1 pad=1 activation=leaky