ad-freiburg / pdftotext-plus-plus

A fast and accurate command line tool for extracting text from PDF files.
https://pdftotext.cs.uni-freiburg.de
Apache License 2.0
15 stars 0 forks source link

Implement more output formats (TSV, JSON, etc.) and add E2E tests for each. #27

Open ckorzen opened 1 year ago

ckorzen commented 1 year ago

Text++ Control characters, semantic role