Closed felixgwu closed 7 years ago
Hi, thank you for your feedback. Please refer to the PR #10 You must use fast_compile to be able to compile the model and optimizer_including=fusion to enable an important optimization for a speedup during training
Hi,
I was impressed by your excellent work and tried implement a PyTorch version of it. Unfortunately, I have trouble when reproducing your results.
I followed the setting as follows:
I found that without dropout the model learns much better than the one with the one with dropout. However, none of these settings can get the same accuracy as what reported in the paper. The validation accuracy is 0.9372 and the mIoU is 0.7025; however, the test accuracy is only 0.8932 and the test mIoU is 0.5790.
I am wondering what is the data preprocessing method you used. Is there anything wrong with my experiment procedure?
Also, I tried to run your code with my implementation of dataloader (following your explanation of data format). However I got
RuntimeError: error getting worksize: CUDNN_STATUS_BAD_PARAM
It suggested me use 'optimizer=None', but it took me more than 3 hours to compile the model before I killed the job. FYI, I used the latest theano and lasagne with CUDA 8.0 and CUDNN 5.1. Do I have to use a different version?Thanks in advance.