Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
Hello,
Coming from a Google Support Case 51622001: High latency exporting to hocr, which has derived to this issue.
Details
In the method from google.cloud.documentai_toolbox import document as documentai_document_wrapper
When transforming tables, the latency takes from 30 to 50 seconds, depending on the complexity of the page (high data in table formats).
Looking for any type of optimization.
Environment details
google-cloud-documentai-toolbox
version: 0.13.3a0Steps to reproduce
Code example
Stack trace
N/A, the execution is correct, but the latency takes 35 seconds long
Attached sources to reply the test: sources.zip
Thanks!