abhisheks008 / DL-Simplified

Deep Learning Simplified is an Open-source repository, containing beginner to advance level deep learning projects for the contributors, who are willing to start their journey in Deep Learning. Devfolio URL, https://devfolio.co/projects/deep-learning-simplified-f013
https://quine.sh/repo/abhisheks008-DL-Simplified-499023976
MIT License
319 stars 288 forks source link

Separating text from image #681

Open harshmishra19 opened 1 month ago

harshmishra19 commented 1 month ago

Deep Learning Simplified Repository (Proposing new issue)

:red_circle: Separating text from image :
:red_circle:Aim of the project is to provide users with a code that can help them take out text from images and automatically get an report generated on that image :
:red_circle: Dataset :
:red_circle: Using some libraries like pytesseract : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


πŸ“ Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

github-actions[bot] commented 1 month ago

Thank you for creating this issue! We'll look into it as soon as possible. Your contributions are highly appreciated! 😊

abhisheks008 commented 1 month ago

Please share the dataset and your approach for solving this issue. @harshmishra19

harshmishra19 commented 1 month ago

yea here's the breakdown

Image Used

designelements

and here is the result image during processing result_image

And here is the report Generated reportgen

abhisheks008 commented 1 month ago

What are the deep learning techniques you are planning to implement here?

harshmishra19 commented 1 month ago

Currently I am using Pytesseract which is a Python wrapper for Google's Tesseract-OCR Engine, which is an optical character recognition (OCR) tool used to extract text from images. But I can opt for Convolutional Neural Networks (CNNs) EAST (Efficient and Accurate Scene Text Detector): This model can detect text in natural scenes.

abhisheks008 commented 1 month ago

Currently I am using Pytesseract which is a Python wrapper for Google's Tesseract-OCR Engine, which is an optical character recognition (OCR) tool used to extract text from images. But I can opt for Convolutional Neural Networks (CNNs) EAST (Efficient and Accurate Scene Text Detector): This model can detect text in natural scenes.

Implement all the above mentioned models for this dataset and find out the best fitted one based on the accuracy scores of the implemented models/methods.

Assigned @harshmishra19