src-
which contains code files
testimages -
which contains testing images
result -
it contains JSON object file which contains extracted information
Usage:
python id_card_detail_extract.py
Output will be JSON object name
Steps:
First, our Faster RCNN model tries to Identity Regions of Interest (ROI) containing the required information with deep learning.
If a card is detected it creates a bounding box around it and crops that part of the image.
That cropped image is then fed into our OpenCV and pytesseract model where we perform text extraction.
Our model extracts information such as Name, Gender, Mobile No, UID and Aadhar no. from the image.
The Extracted information is then printed and fed into a JSON file, where it is saved.
Model Accuracy and Performace:
The accuracy of our model mostly depends on the quality of the image as well as the orientation. (The model presently fails to extract information from tilted or inverted image)
The Faster RCNN model is able to achieve accuracy of more than 90% for detecting a card in the image but also makes the process a bit slower.
But, For extracting Text from the Image the model needs to be customized as per the nature of different cards like Aadhar Card, Driving License, Pan Card, etc.
(The above steps were implemented for extracting Information from Aadhar Card and the model works well for it)
Structure and Usage
Directories:
Usage:
Steps:
Model Accuracy and Performace:
(The above steps were implemented for extracting Information from Aadhar Card and the model works well for it)