plutext / docx4j-ImportXHTML

Converts XHTML to OpenXML WordML (docx) using docx4j
136 stars 126 forks source link

docx4j-ImportXHTML

Converts XHTML to OpenXML WordML (docx) using docx4j; there is also some support for converting to pptx.

Maven Central

docx4j is licensed under ASLv2.

This project is licensed under LGPL v2.1 (or later), which is the license used by openhtmltopdf (the main dependency).
See legals/NOTICE for details.

docx4j for JAXB 3.0 and Java 11+

docx4j-ImportXHTML v11.4.6 uses Jakarta XML Binding API 3.0, as opposed to JAXB 2.x used in earlier versions (which import javax.xml.bind.*). Since this release uses jakarta.xml.bind, rather than javax.xml.bind, if you have existing code which imports javax.xml.bind, you'll need to search/replace across your code base, replacing javax.xml.bind with jakarta.xml.bind. You'll also need to replace your JAXB jars (which Maven will do for you automatically; otherwise get them from the relevant zip file).

Being a JPMS modularised release, the jars also contain module-info.class entries.

To use it, add Maven Central

plus the dep corresponding to the JAXB implementation you wish to use

You should use one and only one of docx4j-JAXB-*

docx4j-ImportXHTML v8

This is docx4j-ImportXHTML for Java 8.

How do I build docx4j?

Get it from GitHub, at https://github.com/plutext/docx4j

mvn clean
mvn install