This project levarages OpenAI embeddings and Langchain to train OpenAI's GPT model on your own PDF files. This allows us to ask questions about our own data and allow the large language model to analyse and answer us.
- Train ChatGPT's AI model on your own documents in seconds.
- Extract salient information easily by asking questions.
- Ask for summaries or specific information.
- Python 3.8+
- Pip package installer for Python
- Internet connection for downloading dependencies
-
Clone the repository to your local machine:
git clone https://github.com/chonguschonguschongus/PDF-reader.git
-
Navigate to project directory
cd PDF-reader
-
Install the required dependencies:
pip install python-dotenv streamlit PyPDF2 langchain chromadb
-
Insert your OpenAI API key
OPENAI_API_KEY = "YOUR_KEY_HERE"
Open terminal and run streamlit run pdfreader.py
Click the "Browse files" button to browse locally and select your desired document.
Ask any questions you might have and the chatbot will reply you with information from your document!
- OpenAI for providing the GPT model.
- Python Software Foundation for the Python programming language.