Udayraj123 / OMRChecker

Evaluate OMR sheets fast and accurately using a scanner 🖨 or your phone 🤳.
GNU General Public License v3.0
709 stars 295 forks source link

[Feature] Optical Character Reader [OCR] #99

Open shreekrishnaacharya opened 1 year ago

shreekrishnaacharya commented 1 year ago

Can we also add Optical character reader in this application for reading student names and other detail?

Udayraj123 commented 1 year ago

Yes, we have a related issue to add a plugin for the same. Can you help me formulate the problem with more clarity?

We need help in generalizing the approach.

Adwait01-Bhosale commented 1 year ago

Hello, I can clearly contribute to this issue. As I have many a times worked upon tesseract OCR, Vision API OCR, and Microsoft OCR wherein I had made changes in the OCR by computer vision techniques like gray scaling the image and even giving it proper contrast to read the text accurately.

shreekrishnaacharya commented 1 year ago

Thank you Adwait01. i am also working on this OCR part and would appreciate your part of contribution. actually i am working on my thesis and i need to read document that consist of Mark and some text character. Your contribution would be very helpful. Also is it possible to detect cross mark and remove them from image or replace them with white.. as this would be helpful to remove wrong mark on answer options. Thank you

Udayraj123 commented 1 year ago

Hi @Adwait01-Bhosale, that's great to hear! Can you and @shreekrishnaacharya join on over discord and discuss possible solutions with me?

Udayraj123 commented 1 year ago

Here is a related issue for which I have framed a solution - https://github.com/Udayraj123/OMRChecker/issues/73

Just like extracting text from an area, that issue requires extracting passport photo from an area (using page cropping)

shreekrishnaacharya commented 1 year ago

https://github.com/Udayraj123/OMRChecker/issues/99#issuecomment-1328764312 @Udayraj123 , @Adwait01-Bhosale my discord account is sksharma72000 . I have worked on tesseract OCR, it can detect printed text well however it is not suitable for handwriting detection.

Udayraj123 commented 1 year ago

That's fine for a start. I have a requirement from a business regarding the same. You can help read printed roll numbers or booklet numbers in an area on the image. Later we can scale it for handwriting detection

Udayraj123 commented 1 year ago

@shreekrishnaacharya @Adwait01-Bhosale let me know if you folks had any progress over this tasks. If so, others may decide to work on top of that.

shreekrishnaacharya commented 1 year ago

Yes i have update on it. And make use of it as well. But i dont know hot i can share this task with you. Let me see that. Or please reply the instruction

On Sun, 19 Feb 2023, 5:01 pm Udayraj Deshmukh, @.***> wrote:

@shreekrishnaacharya https://github.com/shreekrishnaacharya @Adwait01-Bhosale https://github.com/Adwait01-Bhosale let me know if you folks had any progress over this tasks. If so, others may decide to work on top of that.

— Reply to this email directly, view it on GitHub https://github.com/Udayraj123/OMRChecker/issues/99#issuecomment-1435960551, or unsubscribe https://github.com/notifications/unsubscribe-auth/AF5YTD2DHKXACARGTRXJNMTWYH6J7ANCNFSM6AAAAAARYMBIDM . You are receiving this because you were mentioned.Message ID: @.***>

Udayraj123 commented 1 year ago

@shreekrishnaacharya you can create a PR with a running code on one of the samples(make sure to take latest master pull)

p.s. or if you're available to discuss your approach first you can tag me on discord voice channel