Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
9.25k stars 767 forks source link

refactor: pdfminer image cleanup #3648

Closed christinestraub closed 2 months ago

christinestraub commented 2 months ago

This PR aims to remove clean_pdfminer_duplicate_image_elements() function, as its functionality has already been integrated into the remove_duplicate_elements() function in PR #3630.