aws-samples / amazon-textract-textractor

Analyze documents with Amazon Textract and generate output in multiple formats.
Apache License 2.0
360 stars 134 forks source link

Is search_words() broken? #371

Open ttruong-gilead opened 2 weeks ago

ttruong-gilead commented 2 weeks ago

amazon-textract-textractor==1.7.9

document.search_words(keyword="Tom Brady") or page.search_words(keyword="Frank")

doesn't work as expected. Returns a list of random letters or words not even close to keywords. Tried playing with the similarity_threshold to no avail.