Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
-
Updated
May 26, 2024 - Python
Extract annotations (highlights and scribbles) from PDF, EPUB, and notebooks marked with reMarkable tablets. Export to Markdown, PDF, PNG, SVG
A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
A watchdog for OCRMyPDF written in go
Moved to codeberg.org - https://codeberg.org/DecaTec/OCRmyFiles - Bash script for adding a text layer to PDF files and converting images in PDFs (with OCR).
A streamlit based webapp to detect scanned/digital PDFs from a large corpus as well as allow the user to OCR the scanned docs
A powerful and user-friendly tool based on OCRmyPDF, offering a seamless GUI for conversion of image-based PDFs into searchable text.
Flask application for OCR and extraction of text from documents with support for repository applications
ocrmyPDF_Windows is inspired by jbarlow83's ocormtpdf. https://github.com/jbarlow83/OCRmyPDF.
TIFF Image Convert to OCR PDF
An attempt to make OCR
Programa em Python destinado a criar um pdf ocr de uma lista de ocr dada. Baseado no ocrmypdf
A simple implementation of ocrmypdf and tesseract with flask for hosting to a server as an API. The code was written on CentOS7. This code works on linux only as ocrmypdf library does not have support on windows because of missing leptonica dll. For windows consider https://github.com/lakshay1296/OCR_Conversion_JPEG2PDF. This is image to ocr pdf…
Notes during the learning of OCRmyPDF, a Tesseract based Optical Character Recognition(OCR) software
Add a description, image, and links to the ocrmypdf topic page so that developers can more easily learn about it.
To associate your repository with the ocrmypdf topic, visit your repo's landing page and select "manage topics."