yuxiangsun / RTFNet

RGB-Thermal Fusion Network for Semantic Segmentation of Urban Scenes
MIT License
166 stars 39 forks source link

hello, i have a question #9

Closed yeong5366 closed 4 years ago

yeong5366 commented 4 years ago

i trained with your code and didnt touch any codes but the performance was much worse than your result in paper. The best recall value was 0.55 and best iou value was 0.45. i wonder if there is any other methods to be used in training. Especially the class 'bump' recall and iou value is super lower (about 0.005) plus, how many epochs did you train with your code?

Thanks for this nice paper :)

yuxiangsun commented 4 years ago

You may want to train again. I train until the val loss converges.

yeong5366 commented 4 years ago

i appreciate your kind answer. when i checked tensorboard, val loss oscillated but train loss converged. I thought the problem was overfitting to training dataset. (it was about 300 epochs) maybe i waited more time. Thanks. By the way, README says that the mf dataset used in this paper was preprocessed. Can i ask what becomes different from original MFnet dataset?

yuxiangsun commented 4 years ago

mainly the flip operation

SunnyWuYang commented 2 years ago

@yeong5366 Hello, I encountered the same problem as you, I trained ResNet50(pertained) more than once, with the same configuration as the paper (batch size=3, start lr=0.01), but every time the performance is lower than the paper ( mAcc is around 0.54, mIoU is around 0.44), did you finally achieve the performance described in the paper? If it does, what configuration are you using? And how many times did you train in the end? Thanks!

wylbd commented 2 years ago

Thank you for your letter. i will reply your letter as soon as I can! best wishes! yours sincerelyYulun Wu?xml:namespace> Department of Mechanical Engineering,Zhejiang University