pmaupin / pdfrw

pdfrw is a pure Python library that reads and writes PDFs
Other
1.86k stars 272 forks source link

Tagged PDF #125

Open bigweld86 opened 6 years ago

bigweld86 commented 6 years ago

Hi. I need to create PDFs that are 508 compliant. My current PDF generator supports everything but Tagged PDFs, does pdfrw support Tagged PDFs? I didn't see it in the reference documentation. Thanks

bengjerstad commented 6 years ago

What does it mean to support Tagged PDFs?

I am working on converting PDF docs to 508 compliant docs as well. I am using SortSite to scan my companies website. SortSite passes the test for tagged PDFs when I change the trailer.Root.MarkInfo = PdfObject('<</Marked true>>').

Here is my code: https://github.com/bengjerstad/PYSection508/blob/master/PDFfixMark.py

This code does not change anything in the document except the meta data. After using this method for PDF docs, are the documents 508 compliant for tagging or is this just a loop hole to trick the scanner?

SortSite says that the related standard is Section 508 (2017) A 1.3.1 http://www.w3.org/TR/WCAG20/#content-structure-separation-programmatic

The langue is so vague that I really don't know what it means to say that the document is 508 compliant for this issue.