PDFNamer

Automatically rename PDFs by extracting titles using OpenAI API and OCR via Docker.

Quick Start

Run the Docker container:

docker run -e OPENAI_API_KEY='sk--xxxxxxxxx' \
           -e INPUT_FOLDER='/pdf_input' \
           -e OUTPUT_FOLDER='/pdf_output' \
           -v /host_directory/pdf_input:/pdf_input \
           -v /host_directory/pdf_output:/pdf_output \
           cvcore/pdfnamer:latest

It assumes the pdf files are located in /host_directory/pdf_input and are named as YYYYMMDD_filename.pdf. The renamed files will be saved in /host_directory/pdf_output with the format YYYYMMDD_title.pdf.

Prerequisites

Docker: Ensure Docker is installed.
OpenAI API Key: Obtain from OpenAI.

Environment Variables

OPENAI_API_KEY: Your OpenAI API key.
INPUT_FOLDER: Directory inside the container where PDFs are read from.
OUTPUT_FOLDER: Directory inside the container where renamed PDFs are saved.

Development

Install dependencies: pip install -r requirements.txt
Run the script: python your_script.py

License

Licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.devcontainer		.devcontainer
.github		.github
pdfnamer		pdfnamer
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDFNamer

Quick Start

Prerequisites

Environment Variables

Development

License

About

Releases

Packages

Languages

cvcore/pdfnamer

Folders and files

Latest commit

History

Repository files navigation

PDFNamer

Quick Start

Prerequisites

Environment Variables

Development

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages