Can't extract fonts, FontDescriptor.FontFile is None

maxpmaxp / pdfreader

Python API for PDF documents

MIT License

113 stars 26 forks source link

@jenskutilek it's not an issue. The attached file doesn't contain any font files inside. It just describes which font to use.

>>> from pdfreader import PDFDocument
>>> fd = open("hello.pdf","rb")
>>> doc = PDFDocument(fd)
>>> page = next(doc.pages())
>>> sorted(page.Resources.Font.keys())
['Tc1']
>>> page.Resources
{'ProcSet': ['PDF', 'Text'], 'ColorSpace': {'Cs1': <IndirectReference:n=5,g=0>, 'Cs2': <IndirectReference:n=6,g=0>}, 'Font': {'Tc1': <IndirectReference:n=7,g=0>}}
>>> font = page.Resources.Font['Tc1']
>>> font.Subtype, font.BaseFont, font.Encoding
('Type1', 'AAAAAB+Produkt-Regular', 'MacRomanEncoding')
>>> font.FontDescriptor.FontFile is None
True

maxpmaxp / pdfreader

Can't extract fonts, FontDescriptor.FontFile is None #111