cross-xpdf
compiles xpdf PDF tools for macOS (x64), Windows (x86, x64), Linux (x86, x64).
This is only intended to be used for pdfinfo
and pdftotext
.
pdfinfo
is modified to output to a specified file.
pdftotext
is modified to output a preprocessed JSON that contains rich and structured information about the PDF and the text extracted from it:
Also, a command line switch is added to customize the path to poppler-data
directory.
PDF tools are build inside the Docker container.
macOS 10.11 SDK is needed. Place MacOSX10.11.sdk.tar.xz
to the current directory. Extract it from Xcode 7.3.
git clone https://github.com/zotero/cross-xpdf
cd cross-xpdf
mv path_to_sdk/MacOSX10.11.sdk.tar.xz ./
./build.sh
./build/pdftools.tar.gz
contains the built binaries and poppler-data
directory.