nyanp / mnisten

convert image files to mnist idx format
61 stars 30 forks source link

Creating idx format for my dataset #15

Open uzair789 opened 7 years ago

uzair789 commented 7 years ago

Hi, I have a dataset that I have created of gray scale images which i want to use with the siamese network example in caffe in which the documentation uses mnist. I want to replace the mnist dataset with my own dataset

I see that for doing this I need my dataset to be in the format required by the siamese netwrk. This can be created using the 'create_mnist_siamese.sh' which loads the mnist dataset in the idx3-ubyte format and creates a dataset lmdb database with two images and a matching/non matching label in each location of the lmdb database.

So I figured for me to use the 'create_mnist_siamese.sh' script, my dataset also needs to be in the idx-ubyte format. I tried to convert my dataset to the idx-ubyte format using 'mnisten'. However i get the error 'error:total images are less than num_tests'. I guess the script is not identifying my images. The folder structure of the dataset is like this:

parent-directory

parent directory name - 'generated dataset' subfolders - 1 ,2 ,3 ... (the subfolders are titled 1 - 30 as I want to label the data in each subfolder by the name of the subfolder) The txt file contains image title on each row with the class label.

How do I work with my dataset on the siamese network in caffe? Is there a direct way to convert my dataset to the lmdb format for the siamese network? Or do I have to use mnisten? If I do then how do I fix my error ? Anu help will be much appreciated. Thanks.

lenghanyan commented 5 years ago

请问一下,你将自己的数据集制作成了idx-ubyte格式吗??