py-pdf / pypdf

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
https://pypdf.readthedocs.io/en/latest/
Other
7.73k stars 1.36k forks source link

DOC: Add feature support by pypdf #2738

Open j-t-1 opened 3 days ago

codecov[bot] commented 3 days ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 95.14%. Comparing base (de07cbc) to head (33e8a90).

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #2738 +/- ## ======================================= Coverage 95.14% 95.14% ======================================= Files 51 51 Lines 8548 8548 Branches 1703 1703 ======================================= Hits 8133 8133 Misses 261 261 Partials 154 154 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

stefan6419846 commented 3 days ago

What is the data based upon?

j-t-1 commented 3 days ago

Content stream compression is provided by function compress_content_streams. Cross-reference streams, do we support this? I will revert the tick.

stefan6419846 commented 3 days ago

I will wait for @pubpub-zz to check if these statements are valid, having more knowledge on PDF files and specific stuff of pypdf.

pubpub-zz commented 2 days ago

Cross Reference streams are defined in §3.4.7of pdf 1.7 reference. They are tagged with XRefStm

It allows many objects to be stored within a stream which may improve compression.

pypdf can read them without problem. I can not remember any functions allowing to modify/create some.

j-t-1 commented 13 hours ago

Thanks @pubpub-zz. Content Stream Compression support is now the added supported feature.