DeyvidKochanov-TomTom / kprnet

MIT License
65 stars 12 forks source link

Some questions about details. #4

Closed zxczrx123 closed 2 years ago

zxczrx123 commented 3 years ago

Thanks very much for your work!I note that "during training random crops of width 1025 are sampled from the range images" and I'm curious if models trained in this way only and act directly on the raw data will cause a performance loss. I have trained about 80 epochs with default settings and got 61.8 on the validation set, which differs from the 64.1 in the paper, and I would like to know how to improve the performance further.Can you offer some advice?

DeyvidKochanov-TomTom commented 3 years ago

Hi, my best guess is the pre-trained weights weren't loaded correctly. Could you change this line https://github.com/DeyvidKochanov-TomTom/kprnet/blob/master/train_kitti.py#L73 to True and see what happens? The expected outcome is that it will crash but you should see a list of the weights which were not loaded from the checkpoint. We ran one experiment with the code from the repo before publishing it reaches 63+ mIoU fairly fast. Way faster than 80 epochs In the end there was a small deviation from the result in the paper but it was about 0.2 mIoU so it could be just variance. image

zxczrx123 commented 3 years ago

Sorry, I overlooked my change to the batch size.For fast training,I use batch size=7 with 8 V100 32G,I think that might be the reason for the difference.Do you think so?I'll also verify if the pre-training loads correctly afterwards.

DeyvidKochanov-TomTom commented 3 years ago

I guess doing this okay but then you need to change the learning rate and OHEM threshold