-
the attached input document contains text then a table followed by some text, we want the text file to be the same as the input pdf file.
![input_page](https://github.com/user-attachments/assets/fe…
-
The codebase has this line: https://github.com/aws-samples/amazon-textract-textractor/blob/28d6110b08a3584edc4c87022a41d12961b88688/textractor/entities/table.py#L640
to retrieve the cell content. …
-
I recently add a new package in my project by means of `uv add packagename` which resulted in the yanked dependecy warning. How should I handle this scenario? Is there a command that "re-resolve the p…
-
I noticed that even when testing extreme values of heuristic_line_break_threshold, heuristic_overlap_ratio, and heuristic_h_tolerance there was no change in the output. This led me to examine their us…
-
Hi,
Per [S3 docs](https://docs.aws.amazon.com/AmazonS3/latest/dev/notification-content-structure.html):
> The s3 key provides information about the bucket and object involved in the event. The …
-
Hello, I am having an issue with merged columns and I realize the example in the documentation also suffers from this. If you look at the example of the "Consolidated Statement of Cash Flows" @ https:…
-
Are there plans to fix this issue:
https://github.com/aws-samples/amazon-textract-response-parser/blob/cd24097feb4349d35822ff039dd0f40a6ad8b48c/src-python/trp/trp2.py#L595
I've noticed that the …
-
In the documentation, this example: https://aws-samples.github.io/amazon-textract-textractor/notebooks/visualizing_results.html#Visualizing-the-result-of-a-search does not generate the right output.
…
-
https://shannonhung.github.io/posts/textract-multi-column/
爲了避免健忘,而開始記錄
-
start_document_analysis in the documentation says it supports a list of PIL images, but in the source code https://github.com/aws-samples/amazon-textract-textractor/blob/e40f5b0378f9ee24d0a757de414505…