aws-solutions / enhanced-document-understanding-on-aws

Enhanced Document Understanding on AWS delivers an easy-to-use web application that ingests and analyzes documents, extracts content, identifies and redacts sensitive customer information, and creates search indexes from the analyzed data.
https://aws.amazon.com/solutions/implementations/enhanced-document-understanding-on-aws/
Apache License 2.0
29 stars 10 forks source link

Some documents fail entity detection due to repeating words #34

Closed jamesnixon-aws closed 5 months ago

jamesnixon-aws commented 5 months ago

Describe the bug When the line where an entity is present contains some words from the entity just before the actual entity, the entity detection fails.

To Reproduce Upload a document with the above conditions to a case with an entity detection workflow

Expected behavior Detection succeeds, or on failure we do not fail the whole workflow.

Please complete the following information about the solution:

ihmaws commented 5 months ago

Resolved as of v1.0.6