pdf-association / pdf-issues

Industry-based resolutions for issues and errata reported against any PDF-related specification
https://pdf-issues.pdfa.org/
64 stars 2 forks source link

Organizational problems in 7.9.2 String object types #263

Open johnwhitington opened 1 year ago

johnwhitington commented 1 year ago

I am working from a copy of the ISO document purchased in Dec 2021.

7.9.2 contains a helpful diagram, Figure 7, which organizes strings types in a tree structure like so:

string types -> text string -> -> PDFDocEncodedString -> -> UTF-16BE encoded string... -> -> UTF-8 encoded string... -> ASCII string -> byte string

But the text itself, is organized in sections like this:

7.9.2.2 Text String Type 7.9.2.3 PDFDocEncoded Type 7.9.2.4 Byte string type

i.e "PDFDocEncoded" is not within "Text String" like it is in Figure 7.

I Suggest reworking 7.9.2 so its sections and subsections match the structure of Figure 7. As well as moving things around, this would involve adding a single-sentence section, even if it is obvious, on what "ASCII string" means.

petervwyatt commented 1 year ago

While I 110% agree with the intent of this suggestion as it will make things more logical and easier to understand, for errata I would rather not do this now as changing heading numbering will get confusing for other errata (as to whether the heading number referenced is the old or new heading).

So suggest to park this suggestion until the next major edition of the core PDF specification when it can be properly done.