Closed bharat-patidar closed 5 years ago
Hi, please look at: https://cloud.google.com/vision/docs/ocr
With pdf2pdfocr you have to rewrite "do_ocr_tesseract" function to generate a PDF file only with text information from OCR engine.
Multiple pages document can be affected by upload procedures.
Is this API for free? If so, I can have a look and try to code something. Pull requests are welcome also!
Hi Leo, Thank you for the instant response. No, this API is not free. Can you guide me to create PDF file from the text information of Google's OCR engine, so that I can utilize this PDF in your further part of code. I have attached link for text file which is the result of google vision api when I passed an image for OCR. https://www.dropbox.com/s/fxb8ond9d1tli6l/gv_output.txt?dl=0
Hello! Found this piece of code, but haven't tested it yet. https://github.com/dinosauria123/gcv2hocr If you can generate the corresponding the hocr file, than we have a chance... :-)
Hi, please look this also: https://github.com/PublicI/pdf-gcv-ocr
Thank You so much man for sharing this stuff!! :)
Hello, I want to replace tesseract engine with Google vision API. Can you please suggest me how to do the same. thanks