Closed chuhang closed 8 years ago
In my experience the CUBLAS error is when you have training data that exceeds your output classes. If you are outputting 5 classes from your final convolutional layer, then your label data should be in the range 0 - 4 (inclusive).
Often running in CPU mode may give you a more descriptive error too?
Alex
Hello, @alexgkendall. I have the same issue above. I labelled data from 0 - 4 (inclusive). But I cropped my data, and some of data (because of cropping) contain only some classes (for instance only 0 and 1). Could this be a source of the problem?
No, this is fine to have only some of the classes present, this is the case in CamVid too.
Could you please paste your model file?
Hello, @alexgkendall. Thank you for your quick responses! Actually ... maybe the issue was with Caffe configuration. In our server, we re-installed the caffe-segnet, and re-ran the experiment ... and it worked!!!
Thank you again!
Hi,
I am trying to train a segmentation network with 5 categories. I have prepared my images and labels and modified the path.
The only change I made to the segnet_train.prototext file is: num_output and loss_param.
However, when I start training I got either 'caffe free(): invalide next size' or 'CUBLAS_INTERNAL_ERROR'.
I can run the training command listed on the tutorial without error.
What would be the problem here? Thanks in advance!