Skip to content

Latest commit

 

History

History

extract_text_from_pdf

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Extract text from pdf

Python can also be used to easily extract text from PDFs using the PyPDF2 package. Getting text from a PDF file proves useful for data mining, invoice reconciliation, or report generation, and the extraction process can be automated in just a few lines of code. You can run pip install PyPDF2 in your terminal to install the package. Below are a few examples of what you can achieve using Py2PDF2:

$ extract_text_from_pdf filename