Document Loaders

Note

Combining language models with your own text data is a powerful way to differentiate them. The first step in doing this is to load the data into "documents" - a fancy way of say some pieces of text. This module is aimed at making this easy.

A primary driver of a lot of this is the Unstructured python package. This package is a great way to transform all types of files - text, powerpoint, images, html, pdf, etc - into text data.

For detailed instructions on how to get set up with Unstructured, see installation guidelines here.

The following document loaders are provided:

.. toctree::
   :maxdepth: 1
   :glob:

   ./document_loaders/examples/*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document_loaders.rst

document_loaders.rst

Document Loaders

Files

document_loaders.rst

Latest commit

History

document_loaders.rst

File metadata and controls

Document Loaders