jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
MIT License
6.57k stars 659 forks source link

I can't extract mathematical expression texts. #329

Closed ai-motive closed 3 years ago

ai-motive commented 3 years ago

I wanna extract mathematical expression texts in pdf.

Attached pdf : 8_11074_XnE0n_4_p.pdf

preview image ex)

스크린샷, 2020-12-31 16-46-42

Looking at the second line, I want to get the following latex text output.

① -2^2 = 4 ② (-1)^101 = -101 ... ④ (-\frac{1}{2})^2 = -\frac{1}{4}

can I take some helps ?

jsvine commented 3 years ago

Hi @ai-motive, and thank you for your interest in this library. It does not currently provide any special support for mathematical expressions or LaTeX output. Closing this issue for now, but if someone in the community would like to try adding this support, feel free to discuss it here.