Closed Devvarat closed 1 month ago
What did you use as margins
parameter?
Did not use any margins
. Can you recommend any specific margins
value that might work?
I am using pymupdf4llm to read .pdfs
Hi JorjMcKie,
Thanks for the reply.
Playing with margins
parameter solved our issue. I am trying to markdown resume pdfs and sometime resumes cover all of the page, so setting margins=0
worked for me.
Thanks
Please consult the documentation: the default margins value is (0, 50, 0, 50)
, so stripes of height 50 are ignored at top and bottom of each page. Use margins=0
in your case.
pymupdf4llm.to_markdown(filepath)
to_markdown() function is missing first and last line on page. Underline pymupdf get_text() function works fine and return the complete page text.