SimJeg / FC-DenseNet

Fully Convolutional DenseNets for semantic segmentation.
487 stars 143 forks source link

Problem of reproducing the results #11

Closed felixgwu closed 7 years ago

felixgwu commented 7 years ago

Hi,

I was impressed by your excellent work and tried implement a PyTorch version of it. Unfortunately, I have trouble when reproducing your results.

I followed the setting as follows:

I found that without dropout the model learns much better than the one with the one with dropout. However, none of these settings can get the same accuracy as what reported in the paper. The validation accuracy is 0.9372 and the mIoU is 0.7025; however, the test accuracy is only 0.8932 and the test mIoU is 0.5790.

I am wondering what is the data preprocessing method you used. Is there anything wrong with my experiment procedure?

Also, I tried to run your code with my implementation of dataloader (following your explanation of data format). However I got RuntimeError: error getting worksize: CUDNN_STATUS_BAD_PARAM It suggested me use 'optimizer=None', but it took me more than 3 hours to compile the model before I killed the job. FYI, I used the latest theano and lasagne with CUDA 8.0 and CUDNN 5.1. Do I have to use a different version?

Thanks in advance.

SimJeg commented 7 years ago

Hi, thank you for your feedback. Please refer to the PR #10 You must use fast_compile to be able to compile the model and optimizer_including=fusion to enable an important optimization for a speedup during training