sambitdash / PDFIO.jl

PDF Reader Library for Native Julia.
Other
127 stars 13 forks source link

Incorrect Doc Info #114

Closed wardjm closed 4 months ago

wardjm commented 4 months ago

With given file as input, we see garbage Doc Info.

info = pdDocGetInfo(doc) Dict{String, Union{CDDate, String, CosObject}} with 4 entries: "Producer" => "\u9fP\x13ñé(ä\x05'0qùˆj-ŁuSþöFüÖNÙ\x17\u9f\n<\u009f6…¦µÙ'" "Author" => "‰5Î" "Creator" => "‘F\0êÿ;ÿ\x05'\x06X”E\0\x02Ã*0…Ž&‹õs‰\aùC\x10¢y§†ÔÃ6\x01¬ÁıGG»9" "Title" => "š[\0îÿ,â\x05FE\x17è\x01o0ž\0낢üóNÅX\uad\0&™:†¡"

Using exiftool:

$ exiftool 2GFKZ3FSJWFKVKALUPEYHSLSHEJ3OGA6.pdf ExifTool Version Number : 12.40 File Name : 2GFKZ3FSJWFKVKALUPEYHSLSHEJ3OGA6.pdf Directory : . File Size : 5.5 KiB File Modification Date/Time : 2018:12:04 17:06:47-05:00 File Access Date/Time : 2024:04:15 15:48:57-04:00 File Inode Change Date/Time : 2024:04:15 15:33:23-04:00 File Permissions : -rw-rw-r-- File Type : PDF File Type Extension : pdf MIME Type : application/pdf PDF Version : 1.4 Linearized : Yes Encryption : Standard V2.3 (128-bit) User Access : Print, Fill forms, Extract, Print high-res Create Date : 2005:03:24 11:02:35Z Modify Date : 2005:03:24 11:41:42-05:00 Page Count : 1 Creation Date : 2005:03:24 11:02:35Z Author : USTP Producer : Acrobat PDFWriter 5.0 for Windows NT Mod Date : 2005:03:24 11:41:42-05:00 Metadata Date : 2005:03:24 11:41:42-05:00 Title : Chapter 11 Filings In Connecticut Creator : USTP

It could be due the encryption? I'm going to try to track down the issue and do a PR, but I posting here for reference later. 2GFKZ3FSJWFKVKALUPEYHSLSHEJ3OGA6.pdf

wardjm commented 4 months ago

Julia package manager uses v0.1.1. This is fixed in v0.1.14.