jzbontar / mc-cnn

Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
BSD 2-Clause "Simplified" License
707 stars 232 forks source link

what is the preprocessing? #34

Open sooyeonshin opened 7 years ago

sooyeonshin commented 7 years ago

can i ask you about your work?

I don't understand about preprocessing with kitti data set.

their are several outs exist like x0.bin, x1.bin, metadata, nnz_tr, nnz_te...

look's like x0.bin is the image pixel data and metadata is the image imformation. but I can't understand what is tr, te, nnz_tr and nnz_te.

nnz_tr = torch.FloatTensor(23e6, 4)?

what is 23e6 meaning?

jzbontar commented 7 years ago
Qurey commented 7 years ago

@sooyeonshin @jzbontar I don't understand about preprocessing with middlebury data set in the preprocess_mb.py mask = cv2.imread('tmp/mask.png', 0) disp0[mask != 255] = 0 y, x = np.nonzero(mask == 255). what is the function of mask and what is mask meaning?

jzbontar commented 7 years ago

The mask stores information about which areas are occluded. Try opening the generated mask.png and it will make sense.