What are you trying to do?

I am using pdfplumber to look for 12 digit strings in a PDF. My code worked when the font was Helvetica, but stopped working when I changed font to stsong-light

What code are you using to do it?

for filepath in glob.iglob(r"C:\Users\thomascrosbie\Desktop\ALL ANALYSIS\ANALYSIS_6*.pdf"): print(filepath) pdf_file = filepath excel_output = set() with pdfplumber.open(pdf_file) as pdf : pages = pdf.pages for i,pg in enumerate(pages): tbl = pages[i].extract_text()

look for account number

        p = re.compile(r"(\d{12})")
        result = p.findall(tbl)
        if(result):
            excel_output.add(result[0])
        else:
            excel_output.add('0')

PDF file

Expected behavior

Excel file with 12 digit string

Actual behavior

12 digit string not detected

Environment

pdfplumber version: 0.5.24
Python version: 3.8.7
OS: Windows 10

jsvine / pdfplumber

Unrecognized Font #335