marianna13 / doc2dataset

A tool to extract text (and images) from documents (like PDFs)
MIT License
2 stars 1 forks source link