python-docx2text A pure python-based utility to extract text from docx files. The code is taken and adapted from python-docx. It can however also extract text from header, footer and hyperlinks. How to run? python py_docx2text.py file.docx