radkovo / Pdf2Dom

Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox. Pdf2Dom is based on the Apache PDFBox™ library.
http://cssbox.sourceforge.net/pdf2dom/
GNU Lesser General Public License v3.0
175 stars 71 forks source link

Handling the exceptions if fontvetter not supporting a TrueTypeFont like Hevertica etc.. #54

Closed vikas-p-r closed 2 years ago

vikas-p-r commented 2 years ago

… need to rethrow that exception as it will clear the entire font info making the method return null..

This can fix most of the spacing issues in unsupported font case..

radkovo commented 2 years ago

Good point, thanks! Merged.