ChatPDFLike

An end-to-end document question-answering application using large language model APIs.

Note: This project is not affiliated with or endorsed by ChatPDF. This is an independent project attempting to replicate similar functionality.

Overview

ChatPDF-Like is a web application that allows users to upload PDF documents and interact with them using natural language queries. The application leverages large language models (LLMs) like OpenAI's GPT-3.5 Turbo to understand the content of the PDF and provide concise and accurate answers to user questions.

Features

PDF Document Upload: Upload local PDF files or provide a URL to a PDF document.
Natural Language Interaction: Ask questions about the content of the PDF in natural language.
Relevant Answers: Receive concise answers based on the content of the document.
Source References: View sources (sections of the PDF) that were used to generate the answer.
Multiple LLM Providers: Support for both OpenAI and Ollama models.
Web Interface: Simple and intuitive web interface built with Flask and JavaScript.

How It Works

The application follows these main steps:

Text Extraction and Processing:
- The PDF is parsed using PyPDF2.
- Text is extracted from each page, and large pieces of text are split into manageable chunks.
Embedding Generation:
- For each text chunk, an embedding vector is generated using the selected embedding model (e.g., OpenAI's text-embedding-ada-002).
- These embeddings represent the semantic meaning of the text chunks and are stored for similarity calculations.
User Query Handling:
- When a user asks a question, an embedding vector for the query is generated using the same embedding model.
Similarity Search:
- The application computes the cosine similarity between the query embedding and the text chunk embeddings.
- The most relevant text chunks are selected based on the highest similarity scores.
Prompt Construction:
- A prompt is created for the language model, incorporating the user's question and the most relevant text chunks.
Answer Generation:
- The prompt is sent to the language model (e.g., OpenAI's GPT-3.5 Turbo).
- The model generates an answer to the user's question based on the provided context.
Response Display:
- The answer is displayed to the user in the web interface.
- References to the source text chunks are also provided for transparency.

Getting Started

Prerequisites

Python: Version 3.6 or higher is required.
API Keys:
- OpenAI API Key: Required to use OpenAI's models for embeddings and answer generation.
- Ollama API Key: Optional. Required if you want to use Ollama models.

Installation

Clone the Repository

git clone https://github.com/Ulov888/chatpdflike.git
cd chatpdflike

Install Dependencies

Using pip, install the required packages:
```
pip install -r requirements.txt
```

API Keys

To use OpenAI's API:

Sign up for an API key at OpenAI.

Set the OPENAI_API_KEY environment variable:

export OPENAI_API_KEY="your_openai_api_key"

To use Ollama's API (if desired):

Obtain an API key from Ollama.

Set the OLLAMA_API_KEY environment variable:

export OLLAMA_API_KEY="your_ollama_api_key"

Usage

Start the Application

Run the Flask application:
```
python run.py
```
By default, the server runs on http://0.0.0.0:8080.
Access the Web Interface

Open a web browser and navigate to http://localhost:8080.
Upload a PDF Document

You can either:
- Click on "Upload PDF" to select and upload a PDF file from your computer.
- Enter a URL to a PDF document and click "Submit".
Interact with the PDF
- Once the PDF is processed, you can ask questions about its content using the chat interface on the right side of the screen.
- Type your question in the input box and press "Send".
View Answers
- The application's response will appear below your question.
- Source references (e.g., page numbers and excerpts) are provided for context.

Customization

Prompt Strategies

The behavior of the language model can be customized by modifying the prompt strategies in generate_embedding.py, specifically in the create_prompt method of the Chatbot class.

Strategies include:

Paper: For summarizing scientific papers.
Handbook: For summarizing financial handbooks (answers in Chinese).
Contract: For understanding contracts (answers in Chinese).
Default: General-purpose strategy (answers in Chinese).

To select a strategy, you can modify the strategy parameter when calling create_prompt.

Language and Output

The application is currently configured to provide answers in Chinese for some strategies. You can modify the prompts to change the language or adjust the behavior of the model.

Limitations

OpenAI API Costs: Using OpenAI's API will incur costs based on usage. Make sure to monitor your API usage to avoid unexpected charges.
PDF Parsing: The application uses PyPDF2, which may not handle all PDFs perfectly. Complex PDFs with unusual formatting may not parse correctly.
Embedding Limits: The maximum token limit for embeddings may restrict the size of text chunks or the maximum length of the prompt.
Model Responses: The quality and accuracy of the answers depend on the performance of the language model and the relevance of the retrieved text chunks.

Contributing

Contributions are welcome! If you have any suggestions or improvements, feel free to submit an issue or pull request.

License

This project is licensed under the Apache License.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
static		static
templates		templates
LICENSE		LICENSE
README.md		README.md
generate_embedding.py		generate_embedding.py
gif.gif		gif.gif
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChatPDFLike

Overview

Features

How It Works

Getting Started

Prerequisites

Installation

API Keys

Usage

Customization

Prompt Strategies

Language and Output

Limitations

Contributing

License

About

Releases

Packages

Contributors 2

Languages

License

Ulov888/chatpdflike

Folders and files

Latest commit

History

Repository files navigation

ChatPDFLike

Overview

Features

How It Works

Getting Started

Prerequisites

Installation

API Keys

Usage

Customization

Prompt Strategies

Language and Output

Limitations

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages