FreeUKGen / SummerOfCodeImages

Base images and gold standard datasets for Summer of Code projects involving computer vision and image classification/segmentation.
Apache License 2.0
4 stars 4 forks source link

Adding image-splitter that uses Canny-Edge detection. #4

Open iamgroot42 opened 6 years ago

iamgroot42 commented 6 years ago

Addresses issue https://github.com/FreeUKGen/SummerOfCodeImages/issues/1

Using the fact that pages are more-or-less of the same size, and that there is some space between them, I have to build a method to split such given images into two pages. This method does not require any training or annotations. Based on the assumption that the images won't be too rotated, it will give good results. Some visualizations from the given sample dataset are:

Left half: image

Right half: image

This method is quite fast: it splits all the given images in a matter of seconds and does not require high-end GPUs or deep machine-learning models (or humongous data) to train it.