A9T9 / Free-Ocr-Windows-Desktop

Free open-source OCR application for the Windows Desktop - A modern GUI front-end for the Tesseract OCR engine. The application also includes support for reading and OCR'ing PDF files.
https://ocr.space/blog/p/free-ocr-windows.html
GNU Affero General Public License v3.0
249 stars 72 forks source link

Is this same as your online OCR? #2

Open 9tontruck opened 8 years ago

9tontruck commented 8 years ago

Hi,

I couldn't have it working in my computer yet but just looked through your code. It seems that it just uses Tesseract-OCR's basic function to detect texts. Which is:

using (var engine = new TesseractEngine(_pathToTestData, Language, EngineMode.Default)) { using (var img = Pix.LoadFromFile(pathToImage)) { using (var page = engine.Process(img)) { var resultText = page.GetText(); if (!String.IsNullOrEmpty(resultText)) return resultText; } } }

I know about this code and it wasn't very great. However your online OCR (https://ocr.space/) works really great. How come they are different? Are they really using same code?

I guess they are using different tessdata. Would you be able to share your tessdata too?

A9T9 commented 8 years ago

You are right, both projects use a different OCR engine. This blog posts explains it: http://blog.a9t9.com/2015/09/ocr-api.html