aspose-ocr / Aspose.OCR-for-.NET

Aspose.OCR for .NET examples, plugins and showcase projects
https://products.aspose.com/ocr/net
MIT License
43 stars 34 forks source link

OCR not reading tiff file effectively #25

Closed ShekharDewan closed 6 years ago

ShekharDewan commented 6 years ago

The included tiff file (SampleTiff.tiff) does not seem to be well processed by the OCR. I tried targetting it with PerformOCROnImage.Run() and tried just uncommenting OCROnMultipageTIFF.Run(), both with lackluster results. Both output only one line (similar in both cases), and its unclear where the line is coming from in the tiff file.

kashifiqb commented 6 years ago

@RoomTemprature ,

We are analyzing your mentioned issues and will soon update you with our findings here.

m-ikramulhaq commented 6 years ago

@RoomTemprature,

You are correct. Output after performing Aspose.OCR operation on this image is not correct. This image has colored background. At the moment, Aspose.OCR has issues with colorful backgrounds. This issue has already been logged into our system.

We are sorry for the inconvenience.

ShekharDewan commented 6 years ago

Thanks for the update. The OCR also seems to struggle with crowded images, such as sample1.jpg, but I'm guessing that's not a simple fix either. That image doesn't have a colored back ground.

kashifiqb commented 6 years ago

We will look into it along with the issue already reported earlier.

m-ikramulhaq commented 6 years ago

This image can be used with User Defined Recognition Block option [AddRecognitionBlock]. Further, the image contains data in tabular format. Please note that current implementation does not support extracting data from table format. This issue has also been logged into our system.