py-pdf / benchmarks

Benchmarking PDF libraries
BSD 3-Clause "New" or "Revised" License
227 stars 11 forks source link

Add pikepdf image extractor #4

Open mara004 opened 2 years ago

mara004 commented 2 years ago

I'd be interested in seeing pikepdf in the image extraction benchmark. It provides some pretty sophisticated code that can, in many cases, extract and save PDF images without needing to decode/re-encode the raw data (i. e. without degrading quality/compression).

https://pikepdf.readthedocs.io/en/latest/topics/images.html https://github.com/pikepdf/pikepdf/blob/master/src/pikepdf/models/image.py