Anish-M-code / pdftotext

A simple pdftotext conversion tool for Windows 8.1/10/11 and FEDORA/UBUNTU/DEBIAN/ARCH based linux distros using poppler-utils and Google's tesseract-ocr.
MIT License
15 stars 3 forks source link
hacktoberfest hacktoberfest-accepted hacktoberfest2022 ocr ocr-python ocr-recognition ocr-text-reader pdf pdf-documents pdftools pdftotext poppler-utils tesseract-ocr

PDF TO TEXT CONVERTER

A simple Python script to convert PDF Documents to Text Files .

Primary Supported Platforms

Quick Installation

To Install from PyPI:

Run the following commands in Linux terminal / Windows powershell / command prompt to install:-

pip install pdftotext3

Then simply type the following command inside the folder/Directory containing PDF Files to start converting PDF to text :-

pdftotext

For Windows Platform Additional software is required for Proper Functioning of this program , refer Windows Requirements here. To run the program by directly downloading from github refer Instructions here.

NOTE: THIS TOOL IS MEANT TO CONVERT THOSE PDF DOCUMENTS WHICH ARE NOT EASILY CONVERTBLE TO OTHER FORMATS. CURRENTLY THIS TOOL SUPPORTS ENGLISH ONLY.