Closed mahxn0 closed 5 years ago
The output of the depth can be interpolated to the full size. Generally the memory limit decides how large the dense estimation would be.. in this case its half the input.
For the KITTI model you are on the right track I think. Let me know if you encounter errors.
Hey what about training my own dataset of 1920x1080 images? Do I need to resize them? Where in the code do I need to make changes? Please help since I'm stuck on this for days.
This training bug does not seem to go away.. Also when I do some changes, the training exits on first epoch
My dataset consists of 51 training images and 654 test images. Batch size is 4.
You have different options to fix this. Either you simply resize all inputs to (384, 1248, 3) (height, width, channel) and do refinements on our already trained model or you could try to compute the best values from the original data that can be divided by 32 (I think in this case just cropping the height to 1056).
Which approach is best depends on many things and its hard to predict before hand. One thing to keep in mind is the batch size. If your card doesn't allow for more than 4 because of this larger image think about resizing until it fit. I found that batch size of 8 usually gives out the best result.
So I leave the width as it is and only resize the height to 1056.. so will my final training image be 1920x1056? And set the depth shape to half this value right?
Exactly.
You have different options to fix this. Either you simply resize all inputs to (384, 1248, 3) (height, width, channel) and do refinements on our already trained model or you could try to compute the best values from the original data that can be divided by 32 (I think in this case just cropping the height to 1056).
Which approach is best depends on many things and its hard to predict before hand. One thing to keep in mind is the batch size. If your card doesn't allow for more than 4 because of this larger image think about resizing until it fit. I found that batch size of 8 usually gives out the best result.
Okay, will give this a try
Hi, I've done as you said but the training wouldn't start. I have put a print statement in the on_epoch_end
callback function but the training isn't reaching there.
Following is my log:
The training automatically exits after the last line with no errors thrown.
thanks for your great work I can't understand why shape_rgb is 640480, the shape_depth is 320240?
If I used kitti.h5 to fintuning,my dataset is 1280720, First I resized all rgb and depth to 3701224?
then there some code do I need to modify ?
there is my dataload code:
look forward you reply ^_^