Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTML file or further processed. A command-line utility for converting the PDF documents to HTML is included in the distribution package. Pdf2Dom may be also used as an independent Java library with a standard DOM interface for your DOM-based applications or as an alternative parser for the CSSBox rendering engine in order to add the PDF processing capability to CSSBox. Pdf2Dom is based on the Apache PDFBox™ library.
In updateStyle function, the "font.toLowerCase().lastIndexOf(pdFontType[i]) >= 0" will be error with "TimesNewRoman,Bold" font. The pdFontType will be detected as "roman" -> fontWeight is normal.
In updateStyle function, the "font.toLowerCase().lastIndexOf(pdFontType[i]) >= 0" will be error with "TimesNewRoman,Bold" font. The pdFontType will be detected as "roman" -> fontWeight is normal.