Azure / azure-sdk-for-python

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our versioned developer docs at https://azure.github.io/azure-sdk-for-python.
MIT License
4.6k stars 2.81k forks source link

Get Searchable PDF from Computer Vision READ API #19085

Closed mananshah1403 closed 2 years ago

mananshah1403 commented 3 years ago

Is your feature request related to a problem? Please describe. The issue is, I have a PDF document that I can send to the READ API to get OCR results, although I am trying to find a way to impose that text on pdf, so it can searchable

Describe the solution you'd like READ API outputs a searchable pdf along with the results

Describe alternatives you've considered A 3rd party library that will allow drawing invisible text on pdf

annatisch commented 3 years ago

Thanks for your question @mananshah1403 To make sure I route your request to the right team for advice, could you give me a few more details about the service endpoint that you've been working with? It sounds like you've been working directly with the REST API rather than one of the SDKs - is that the case? Would you be able to link me to the API or SDK you've working with so far?

mananshah1403 commented 3 years ago

Thanks for the response @annatisch. I am working with the python package for computer vision. Here is the link https://pypi.org/project/azure-cognitiveservices-vision-computervision/. Let me know if you need any further information.

annatisch commented 3 years ago

Thanks @mananshah1403! I think the service team would hopefully be able to provide some details - adding @TFR258 and @ryogok - can you give us your thoughts on this scenario?

ghost commented 3 years ago

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @ryogok, @TFR258, @tburns10.

Issue Details
**Is your feature request related to a problem? Please describe.** The issue is, I have a PDF document that I can send to the READ API to get OCR results, although I am trying to find a way to impose that text on pdf, so it can searchable **Describe the solution you'd like** READ API outputs a searchable pdf along with the results **Describe alternatives you've considered** A 3rd party library that will allow drawing invisible text on pdf
Author: mananshah1403
Assignees: -
Labels: `Cognitive - Computer Vision`, `Service Attention`, `customer-reported`, `question`
Milestone: -
SaurabhSharma-MSFT commented 2 years ago

@TFR258 and @ryogok Can you please help on this issue.

navba-MSFT commented 2 years ago

@mananshah1403 The Computer Vision Read API is Azure's latest OCR technology (learn what's new) that extracts printed text (in several languages), handwritten text (in several languages), digits, and currency symbols from images and multi-page PDF documents. It's optimized to extract text from text-heavy images and multi-page PDF documents with mixed languages.

Requirement: For PDF and TIFF files, up to 2000 pages (only first two pages for the free tier) are processed. The file size must be less than 50 MB (6 MB for the free tier) and dimensions at least 50 x 50 pixels and at most 10000 x 10000 pixels.

Suggestion: Could you refer this REST API and check if that serves your requirement ? Awaiting your reply.

ghost commented 2 years ago

Hi, we're sending this friendly reminder because we haven't heard back from you in a while. We need more information about this issue to help address it. Please be sure to give us your input within the next 7 days. If we don't hear back from you within 14 days of this comment the issue will be automatically closed. Thank you!