torakiki / sambox

A PDFBox fork intended to be used as PDF processor for Sejda and PDFsam
Apache License 2.0
50 stars 19 forks source link

SAMBox PDF processor

Build Status License

An Apache PDFBox fork intended to be used as PDF processor for Sejda and PDFsam related projects

What's different from PDFBox?

Are PDFBox commits merged to SAMBox?

SAMBox is a fork of a SNAPSHOT of PDFBox 2.0.0 and we try to keep it aligned with it. We performed massive changes on the original codebase and the same did the PDFBox guys since the time of the forking so merging back stuff from the PDFBox trunk is sometime challenging, we do our best.

How do I load a document?

Here is a snippet to load a document from a file and write it back to a newFile.

    try(PDDocument document = PDFParser.parse(SeekableSources.seekableSourceFrom(file))){
        document.writeTo(newFile, WriteOption.XREF_STREAM);
    }

Tuning

Some system properties are available to modify SAMBox default behaviour. Take a look at org.sejda.io.SeekableSources and org.sejda.sambox.SAMBox to find out which are currently available.