aws-samples / amazon-textract-serverless-large-scale-document-processing

Process documents at scale using Amazon Textract
Apache License 2.0
328 stars 165 forks source link

Provided Test PDF in this project doesnt get extracted/analyzed #45

Open pchaganti opened 2 years ago

pchaganti commented 2 years ago

The pdfdoc.pdf in the testdocs folder in this project is a simple 1 page PDF. The document doesnt seem to complete extraction. DynamoDB status is InPROGRESS. No error messages in any of the Cloudwatch logs. Anyone know how to track down what the issue might be?

pchaganti commented 2 years ago

Cannot get any PDF to work. png and jpegs are working fine. The PDF just stays in progress in DynamoDB.

pchaganti commented 2 years ago

Suspect the issue is with the SNS notification of completed textract job which does not seem to be getting sent?

pchaganti commented 2 years ago

Anyone from AWS?