bitextor / pdf-extract

PDF parser and converter to HTML
GNU General Public License v3.0
83 stars 14 forks source link

Why can't bold have tags inside it? #39

Closed kpu closed 4 years ago

kpu commented 4 years ago

What if I make things bold and italic? https://github.com/bitextor/pdf-extract/blob/4ad28a23817851355ba65b6b4699a8f01b2cb760/src/pdfextract/PDFExtract.java#L61

dionwiggins commented 4 years ago

Why would you want bold or italic? It is extracting plain text. Plain text does not have bold or italic.

kpu commented 4 years ago

I don't know why you want to detect bold in a PDF extractor. If you don't need to, remove the code. But you have one right now and it has a bug.