Closed 1339503169 closed 3 weeks ago
here is original pdf error.pdf
image generated by get_pixmap()
what looks like in wps
When I use a file viewer such as WPS to view this file, it is normal, but the images generated by get_pixmap() are very strange, and the results obtained by get_text() are problematic
import fitz original_pdf = "path/to/pdf" doc = fitz.open(original_pdf) page = doc.load_page(0) image = page.get_pixmap()
1.24.5
Windows
3.8
This PDF contains severe errors which prevent any meaningful processing.
Description of the bug
here is original pdf error.pdf
image generated by get_pixmap()
what looks like in wps
When I use a file viewer such as WPS to view this file, it is normal, but the images generated by get_pixmap() are very strange, and the results obtained by get_text() are problematic
How to reproduce the bug
import fitz original_pdf = "path/to/pdf" doc = fitz.open(original_pdf) page = doc.load_page(0) image = page.get_pixmap()
PyMuPDF version
1.24.5
Operating system
Windows
Python version
3.8