ocrmypdf

Here are 25 public repositories matching this topic...

lucasrla / remarks

Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG

markdown pdf ocr highlighting annotations pdf-converter epub zotero obsidian ocrmypdf svg-images pymupdf remarkable-tablet roamresearch

Updated May 26, 2024
Python

CypherousSkies / reading-for-listeners

Star

A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!

python nlp pdf ocr deep-learning transformers tts ocrmypdf bert mozilla-tts

Updated Jan 14, 2023
Python

soham-1 / fastapi_pdfextractor

Star

An api using fastapi for extracting the text content of pdf using pdfminer. It also supports scanned images in pdf's by using tesseract and ocrmypdf.

tesseract ocrmypdf pdfminer fastapi

Updated Jun 18, 2021
Python

bernmic / ocrmypdf-watchdog

Star

A watchdog for OCRMyPDF written in go

go docker golang docker-compose ocrmypdf

Updated Feb 12, 2022
Go

DecaTec / OCRmyFiles

Star

Moved to codeberg.org - https://codeberg.org/DecaTec/OCRmyFiles - Bash script for adding a text layer to PDF files and converting images in PDFs (with OCR).

bash pdf ocr images bash-script ocrmypdf

Updated Feb 13, 2022
Shell

prateekralhan / Scanned-PDFs-checker

Sponsor

Star

A streamlit based webapp to detect scanned/digital PDFs from a large corpus as well as allow the user to OCR the scanned docs

ghostscript python3 pdf-document ocrmypdf opensourceforgood pytesseract streamlit

Updated Aug 11, 2022
Python

Achiwilms / OCR-Wizard

Star

A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.

python pdf ocrmypdf ocr-recognition pdf-ocr-extraction ocr-python searchable-pdf ocr-pdf pdf-ocr

Updated Oct 28, 2023
Python

hansmi / baamhackl

Star

Execute command when files are moved to a directory.

golang ocr scanner watchman inotify ocrmypdf

Updated Dec 1, 2024
Go

procesaur / TExASe

Star

Flask application for OCR and extraction of text from documents with support for repository applications

api flask ocr repository tika text-extraction tesseract-ocr ocrmypdf

Updated Sep 7, 2023
Python

DenBeke / ocrmymail

Star

OCRmyMail is an SMTP server relay that adds an OCR text layer to PDF mail attachments and sends them to the original recipient.

docker golang pdf mail ocr server smtp ocrmypdf

Updated May 14, 2021
Go

lakshay1296 / ocrmyPDF_Windows

Star

ocrmyPDF_Windows is inspired by jbarlow83's ocormtpdf. https://github.com/jbarlow83/OCRmyPDF.

python windows flask ocr tesseract windows-10 python3 tesseract-ocr ocrmypdf ocr-recognition tesseract-engine ocr-python tesseract-4

Updated Jan 12, 2020
Python

TheComputeGuy / PDFOCRtool

Star

Add an OCR layer to *any* PDF

python pdf ocr tesseract ocrmypdf pdftopng

Updated Sep 1, 2021
Python

pddd / GUI4OCRMyPDF

Star

Swift UI GUI for ocrmypdf

swift ocr ocrmypdf swiftui

Updated Jan 26, 2023
Swift

Rajasekaran85 / Python-TIFF-to-OCR-PDF

Star

TIFF Image Convert to OCR PDF

pdf ghostscript ocr glob pdf-converter tesseract-ocr ocrmypdf pypdf2

Updated Mar 9, 2024
Python

Guizaords / Conversor_pdf_para_ocr

Star

Programa em Python destinado a criar um pdf ocr de uma lista de ocr dada. Baseado no ocrmypdf

pdf-converter python3 ocrmypdf

Updated Jun 22, 2022
Python

HappyBravo / PDF_to_Text

Star

An attempt to make OCR

javascript python html ocr tesseract tesseract-ocr ocrmypdf pdftotext ocr-python

Updated Jun 24, 2023
HTML

lakshay1296 / ocrmypdf-flask-example

Star

A simple implementation of ocrmypdf and tesseract with flask for hosting to a server as an API. The code was written on CentOS7. This code works on linux only as ocrmypdf library does not have support on windows because of missing leptonica dll. For windows consider https://github.com/lakshay1296/OCR_Conversion_JPEG2PDF. This is image to ocr pdf…

Updated Dec 26, 2019
Python

brlin-tw / learning-ocrmypdf

Star

Notes during the learning of OCRmyPDF, a Tesseract based Optical Character Recognition(OCR) software

linux notes optical-character-recognition ocrmypdf

Updated Apr 3, 2020

miguelpduarte / uporto-menu-ocr-test-dockerized

Sponsor

Star

A dockerized OCR test for parsing UPorto's canteens'/restaurants' menus

pdf ocr parsing ocrmypdf up uporto tessearct

Updated Aug 18, 2019
Shell

sebfischer83 / OCRmyPDF-Server

Star

A small server api for the OCRmyPDF project.

ocrmypdf

Updated Nov 11, 2024
C#

Improve this page

Add a description, image, and links to the ocrmypdf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ocrmypdf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocrmypdf

Here are 25 public repositories matching this topic...

lucasrla / remarks

CypherousSkies / reading-for-listeners

soham-1 / fastapi_pdfextractor

bernmic / ocrmypdf-watchdog

DecaTec / OCRmyFiles

prateekralhan / Scanned-PDFs-checker

Achiwilms / OCR-Wizard

hansmi / baamhackl

procesaur / TExASe

DenBeke / ocrmymail

lakshay1296 / ocrmyPDF_Windows

TheComputeGuy / PDFOCRtool

pddd / GUI4OCRMyPDF

Rajasekaran85 / Python-TIFF-to-OCR-PDF

Guizaords / Conversor_pdf_para_ocr

HappyBravo / PDF_to_Text

lakshay1296 / ocrmypdf-flask-example

brlin-tw / learning-ocrmypdf

miguelpduarte / uporto-menu-ocr-test-dockerized

sebfischer83 / OCRmyPDF-Server

Improve this page

Add this topic to your repo