10up / classifai

Supercharge WordPress Content Workflows and Engagement with Artificial Intelligence.
https://classifaiplugin.com
GNU General Public License v2.0
578 stars 53 forks source link

Update to v3 API to gain PDF OCR functionality #265

Closed jeffpaul closed 3 years ago

jeffpaul commented 3 years ago

Is your enhancement related to a problem? Please describe. Following on from the work in #111 where we're using the OCR API from Computer Vision v2.1, we should look at updating to the Read API (in either Computer Vision v3.1 or v3.2 that's currently in preview) to gain access to OCR functionality for PDF files. Here's additional details on the Read API, most notably that the free tier would only cover the first two pages of a PDF file.

OCR API currently supports input requirements of:

Read API currently supports input requirements of:

Describe the solution you'd like

Designs

Describe alternatives you've considered

Additional context

jeffpaul commented 3 years ago

Note that the Computer Vision API has officially bumped to v3.2: https://azure.microsoft.com/en-us/updates/cognitive-services-new-computer-vision-api-v32-now-generally-available/

jeffpaul commented 3 years ago

We'll now want to consider how this interacts with the Gutenberg PDF inline support: https://wptavern.com/gutenberg-10-5-embeds-pdfs-adds-verse-block-color-options-and-introduces-new-patterns