Align image of pages - Githubissues

rishiraj commented 3 years ago

Goal

Align image of pages that are in different orientations using keypoint matching and a homography matrix so that we can apply OCR to the texts in the page.

Considerations

We'll be developing a generalized document scanner so there won't be a special text filled template to extract keypoints for matching. A perfectly aligned empty page might be a good template. We can then compute a homography matrix, which allows us to apply a perspective warp to align the image of pages.

Deliverables

A Colab Notebook to demonstrate the idea.
A Python script (you can modularize code with multiple scripts too) for the end-to-end execution i.e. this script will take an image of a page as its input, align it in proper orientation, and display the aligned image on the screen.

Tools

You are free to use open-source pre-trained models. If you use someone else's please attribute it. If your code is plagiarized then you will be suspended (applicable only if you are a WoC participant).

This template was adapted from Deep Fusion AI's organization template: thank you Sayak Paul for writing it!

carrycooldude commented 3 years ago

I want to work on this issue and I am Selected for this repo

Abhinav-2901 commented 3 years ago

My idea to overcome this issue is by reordering the the detected corner points of the greatest contour detected so that we can provide those corner points in the correct order for wraping. I have created a reorder function in my python file. I used simple numpy library and some basic maths to do that. Sample output and python file is attached with the comment. test

rishiraj commented 3 years ago

My idea to overcome this issue is by reordering the the detected corner points of the greatest contour detected so that we can provide those corner points in the correct order for wraping. I have created a reorder function in my python file. I used simple numpy library and some basic maths to do that. Sample output and python file is attached with the comment.

Hey @Abhinav-2901 I can't find the Python code you have mentioned here. Please upload it.

Abhinav-2901 commented 3 years ago

Hey, I have created a pull request in which my python file is present. Do I need to comment the code also?

On Tue, 22 Dec, 2020, 10:35 pm Rishiraj Acharya, notifications@github.com wrote:

My idea to overcome this issue is by reordering the the detected corner points of the greatest contour detected so that we can provide those corner points in the correct order for wraping. I have created a reorder function in my python file. I used simple numpy library and some basic maths to do that. Sample output and python file is attached with the comment. [image: test] https://user-images.githubusercontent.com/66773716/102691818-63343d80-4235-11eb-8a15-15b2236cf382.jpeg

Hey @Abhinav-2901 https://github.com/Abhinav-2901 I can't find the Python code you have mentioned here. Please upload it.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/dscnsec/DocScanner/issues/2#issuecomment-749661893, or unsubscribe https://github.com/notifications/unsubscribe-auth/AP5OFVDW3KGQ6DJGH35F26LSWDGWTANCNFSM4UWWFMCQ .

dscnsec / DocScanner

Align image of pages #2

Goal

Considerations

Deliverables

Tools