ArtifexSoftware / pdf2docx

Open source Python library for converting PDF to DOCX.
https://pdf2docx.readthedocs.io
GNU Affero General Public License v3.0
2.46k stars 356 forks source link
docx extract-table pdf-converter pdf-to-word pymupdf

English | 中文

pdf2docx

python-version codecov pypi-version license pypi-downloads

Features

It can also be used as a tool to extract table contents since both table content and format/style is parsed.

Limitations

Documentation

Sample

sample_compare.png