py-pdf / pdfly

CLI tool to extract (meta)data from PDF and manipulate PDF files
BSD 3-Clause "New" or "Revised" License
109 stars 18 forks source link

Fix test_cat_subset_ensure_reduced_size() #32

Closed Lucas-C closed 3 weeks ago

Lucas-C commented 1 year ago

We have identified that pdfly cat, when used to extract pages from a PDF document, produces documents that still contains extra images stored in it (but not visible).

We already have a unit test with a check that reveals this bug: https://github.com/py-pdf/pdfly/blob/main/tests/test_cat.py#L89

We should find a way to fix this. This may mean to fix an underlying bug in pypdf.

Lucas-C commented 3 weeks ago

This has been fixed by https://github.com/py-pdf/pdfly/pull/45