sofiawu

sofiawu

5 followers · 13 following

Beijing

Stars

harrytea / Awesome-Document-Understanding

Document Artifical Intelligence

139 6 Updated Dec 8, 2024

Topdu / OpenOCR

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 448 37 Updated Jan 3, 2025

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 4,837 558 Updated Oct 22, 2024

chaofengc / Awesome-Image-Quality-Assessment

A comprehensive collection of IQA papers

TeX 1,084 73 Updated Jan 5, 2025

adlnlp / form_nlu

12 1 Updated Nov 1, 2024

Royalvice / DocDiff

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Python 250 25 Updated Aug 22, 2024

fh2019ustc / Awesome-Document-Image-Rectification

A comprehensive list of awesome document image rectification papers.

379 29 Updated Apr 2, 2024

ELS-RD / transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Python 1,672 151 Updated Oct 23, 2024

Grzego / handwriting-generation

Implementation of handwriting generation with use of recurrent neural networks in tensorflow. Based on Alex Graves paper (https://arxiv.org/abs/1308.0850).

Python 544 106 Updated Jan 14, 2018

Belval / TextRecognitionDataGenerator

A synthetic data generator for text recognition

Python 3,371 991 Updated Jul 18, 2024

ZZZHANG-jx / Recommendations-Document-Image-Processing

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

200 12 Updated Dec 30, 2024

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,822 1,495 Updated Jan 13, 2025

sparkfish / augraphy

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 372 45 Updated Sep 17, 2024

amoffat / metabrite-receipt-tests

Generates CGI sample receipts for use in receipt scanning CV automated tests

Python 84 8 Updated Nov 22, 2022

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,982 482 Updated Jul 11, 2024

cneud / ocr-gt

OCR & Ground Truth Resources

74 11 Updated May 3, 2022

MAEHCM / AET

Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”

Python 17 3 Updated Dec 6, 2022

ymcui / Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Python 9,792 1,393 Updated Jul 31, 2023

fh2019ustc / DocTr-Plus

The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.

Python 413 46 Updated Nov 4, 2024

fh2019ustc / DocTr

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python 365 50 Updated Jul 21, 2024

gwxie / Synthesize-Distorted-Image-and-Its-Control-Points

Synthesize distorted document image and control points.

Python 42 11 Updated Sep 14, 2022

teresasun / docUnet.pytorch

This is a pytorch implementation of DocUNet: Document Image Unwarping via A Stacked U-Net

Jupyter Notebook 113 32 Updated Feb 15, 2019

cvlab-stonybrook / DewarpNet

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Python 514 101 Updated Nov 10, 2024

qurator-spk / sbb_binarization

Document Image Binarization

Python 75 16 Updated Oct 17, 2024

glexey / excel2img

Save ranges from Excel documents as images

Python 102 26 Updated Dec 9, 2020

ajgallego / document-image-binarization

A selectional auto-encoder approach for document image binarization

Python 102 23 Updated Dec 8, 2022

Dawars / DocMAE

Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning

Python 15 3 Updated Dec 20, 2023

microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,437 265 Updated Jun 24, 2024

dali92002 / DocEnTR

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 147 33 Updated Jan 17, 2025

lucidrains / DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,206 1,089 Updated May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly