I know the question is old, but if you find PDF.js too complicated for the job, npm install pdfreader . (I wrote this module)
To extract text from a PDF file, you need 5 lines of code:
var PdfReader = require("pdfreader").PdfReader; new PdfReader().parseFileItems("sample.pdf", function(err, item){ if (item && item.text) console.log(item.text); });
source share