The following code deploys the reference implementation in your AWS account. The solution deploys different components, including an S3 bucket, Step Functions, an Amazon Simple Queue Service (Amazon SQS) queue, and AWS Lambda functions using the AWS Cloud Development Kit (AWS CDK), which is an open-source software development framework to model and provision your cloud application resources using familiar programming languages.
Install AWS CDK:
npm install -g aws-cdk
Download the repo to your local machine:
git clone https://github.com/aws-samples/amazon-textract-a2i-pdf
Go to the folder amazon-textract-a2i-pdf and enter the following:
pip install -r requirements.txt
Bootstrap AWS CDK:
cdk bootstrap
Deploy:
cdk deploy
Create a private team: https://docs.aws.amazon.com/sagemaker/latest/dg/sms-workforce-management.html
Create a human review workflow: https://console.aws.amazon.com/a2i/home?region=us-east-1#/human-review-workflows
Open the file multipagepdfa2i/multipagepdfa2i_stack.py. Update line 23 with the ARN of the human review workflow.
SAGEMAKER_WORKFLOW_AUGMENTED_AI_ARN_EV = ""
cdk destroy
See CONTRIBUTING for more information.
This library is licensed under the MIT-0 License. See the LICENSE file.