Closed bao-O closed 5 years ago
Here's my loss chart. I'm very confused about it
how large is your dataset? I've found that with very large, very challenging datasets, the loss just doesn't really converge beyond a certain point.
I used WIDER FACE dataset with training set including 12880 images, validating set including 3200 images. Is problem in dataset?
not a problem - just a similar result to me - you have a challenging dataset.
Was your final model enough good? I guess I'll training up to 50k iterators, if there will be no improvement, I change to use yolov3
yeah it was ok - how come ap isn't being plotted on your chart? what command did you run to train with? It's a lot easier to see what's going on when you can see the progress of ap as it trains.
btw yolov3-spp is better value than yolov3
Oh I didn't use -map flag when writing training command. It gets this error when I train with -map flag : "CUDA error: out of memory". Next time I'll try to use it.
Hey, @LukeAI, thanks for your responses to my own questions. Would you mind if I have another question for you?. I already have a trained model of yolov3 and a model of yolov3-tiny - what I'm I training. I knew the speed of yolov3 is slow on CPU, and in my computer, it was too close so the FPS when detecting face in a video always is very low (< 10fps). Can we speed up that or we must use yolov3-tiny instead?
If you are getting "CUDA error: out of memory" - you can decrease memory use by increasing subdivisions in the .cfg
You can speed up yolov3 by decreasing the width and height in the .cfg (multiple of 32) - that's easy so try that first, maybe you will get acceptable accuracy and FPS. You may also find that some other implementation is faster - on the cpu, opencv-dnn is faster than darknet, for example. You could also try one of these alternate network structures https://github.com/AlexeyAB/darknet/issues/3114#issuecomment-494148968
I've run a lot of experiments with them and found tiny-yolov3-pan2 to be a good mid-way point between tiny-yolo and yolov3-spp in terms of AP and FPS
I'm training my yolov3 tiny model for face detection. I followed steps that were described in this repo, but after 9000 iterators, I relized that loss avg was abnormal. It stayed around 4. This is what I got when use command
./darknet detector map
with yolov3-tiny_9000.weights:My config file - yolov3-tiny.cfg:
Those archors are what I got when running
./darknet detector calc_anchors data/voc.data -num_of_clusters 6 -width 416 -height 416
Is this Underfitting? Should I train more iterators, modify my .cfg file, or select other pretrained model and config? Thanks in advance.