fagbokforlaget / pdftotextjs

pdftohtml nodejs wrapper
MIT License
20 stars 15 forks source link

pdftotextjs - pdftotext shell wrapper for Node.js

view on npm npm module downloads Dependency Status Build Status Known Vulnerabilities view on npm

pdftotextjs provides access to pdftotext via shell in nodejs. You'll need pdftotext which comes with poppler-utils.

Installation

via npm:

$ npm install pdftotextjs

Usage

Asynchronous example

const pdftotext = require('pdftotextjs');
const pdf = new pdftotext('test/pdfs/sample.pdf');

// Convert first page only
// These options will be passed to pdftotext
// You may use any valid option
pdf.add_options(['-f 1', '-l 1']);

pdf.getText()
.then(function(result) {
  console.log(result);
})
.catch(function (err) {
  console.error(err);
});

Synchronous example

const pdftotext = require('pdftotextjs');
const pdf = new pdftotext('test/pdfs/sample.pdf');

const data = pdf.getTextSync(); // returns buffer
console.log(data.toString('utf8'));

Tests

$ npm test