Skip to content
View sofiawu's full-sized avatar

Block or report sofiawu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Document Artifical Intelligence

139 6 Updated Dec 8, 2024

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 448 37 Updated Jan 3, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,837 558 Updated Oct 22, 2024

A comprehensive collection of IQA papers

TeX 1,084 73 Updated Jan 5, 2025
12 1 Updated Nov 1, 2024

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Python 250 25 Updated Aug 22, 2024

A comprehensive list of awesome document image rectification papers.

379 29 Updated Apr 2, 2024

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Python 1,672 151 Updated Oct 23, 2024

Implementation of handwriting generation with use of recurrent neural networks in tensorflow. Based on Alex Graves paper (https://arxiv.org/abs/1308.0850).

Python 544 106 Updated Jan 14, 2018

A synthetic data generator for text recognition

Python 3,371 991 Updated Jul 18, 2024

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

200 12 Updated Dec 30, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,822 1,495 Updated Jan 13, 2025

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 372 45 Updated Sep 17, 2024

Generates CGI sample receipts for use in receipt scanning CV automated tests

Python 84 8 Updated Nov 22, 2022

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,982 482 Updated Jul 11, 2024

OCR & Ground Truth Resources

74 11 Updated May 3, 2022

Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”

Python 17 3 Updated Dec 6, 2022

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,792 1,393 Updated Jul 31, 2023

The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.

Python 413 46 Updated Nov 4, 2024

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python 365 50 Updated Jul 21, 2024

Synthesize distorted document image and control points.

Python 42 11 Updated Sep 14, 2022

This is a pytorch implementation of DocUNet: Document Image Unwarping via A Stacked U-Net

Jupyter Notebook 113 32 Updated Feb 15, 2019

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Python 514 101 Updated Nov 10, 2024

Document Image Binarization

Python 75 16 Updated Oct 17, 2024

Save ranges from Excel documents as images

Python 102 26 Updated Dec 9, 2020

A selectional auto-encoder approach for document image binarization

Python 102 23 Updated Dec 8, 2022

Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning

Python 15 3 Updated Dec 20, 2023

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,437 265 Updated Jun 24, 2024

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 147 33 Updated Jan 17, 2025

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,206 1,089 Updated May 11, 2024
Next