Closed danieldimit closed 4 years ago
After some experiments, I found out that lowering the learning rate in yolo-pose-pre.cfg
prevents this error from happening. But the question still stands: Has anyone done pretraining successfully? I tried it but it never actually made a model better than the first one it created (which had acc 0), so it was useless.
We use the pretraining to be able to have reasonable confidence ground-truth when we start the actual training. The following part in the paper also explains this:
As the pose estimates in the early stages of training are inaccurate, the confidence values computed using Eq. 1 are initially unreliable. To remedy this, we pretrain our network parameters by setting the regularization parameter for confidence to 0. Subsequently, we train our network ...
We found it effective to pretrain the model without confidence estimation first and fine-tune the network later on with confidence estimation as well. You can also still train the network from a more crude initialization (with weights trained on ImageNet). However, this usually results in a slower convergence (sometimes even worse convergence, i.e. lower accuracy).
Hi, I am trying to train the network to predict the position of a single object. I am doing the pretraining procedure you've mentioned using the
yolo-pose-pre.cfg
file. I've changed the bottom part of theyolo-pose-pre.cfg
file so that it would infere only 1 class:But every time I try to pretrian (even on the APE object from the LINEMOD datase) at around ~600 images iterated the loss becomes really big really fast until it becomes NaN:
Is this normal and why would it happen? Here is my whole yolo-pose-pre.cfg file that I use.