rkusa / pdfjs

A Portable Document Format (PDF) generation library targeting both the server- and client-side.
MIT License
774 stars 142 forks source link

Incorrect Text Extraction #279

Closed VrajGohil closed 2 years ago

VrajGohil commented 2 years ago

Incorrect text extracted from pdf.

Example code: https://jsfiddle.net/1zurgbsn/3/

Example PDF: https://certotest1.s3.ap-south-1.amazonaws.com/pdfs/35b44d3f4795477c94de594b9d46c1cd.pdf

Upload the pdf on the given example. Its fails to extract word "Certificate", it extracts "Certicate"

rkusa commented 2 years ago

You probably meant to report this over at https://github.com/mozilla/pdf.js/