aws-samples / amazon-textract-textractor

Analyze documents with Amazon Textract and generate output in multiple formats.
Apache License 2.0
408 stars 145 forks source link

Escape html output #387

Closed Belval closed 3 months ago

Belval commented 3 months ago

Issue #, if available: N/A

Description of changes: Addresses an issue where words containing &, < or > would break lxml parsing of the output.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.