OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 495 39 Updated Feb 23, 2025

wnlen / clash-for-linux

clash-for-linux

Shell 2,068 704 Updated Dec 12, 2023

jmhessel / clipscore

CLIPScore EMNLP code

Python 211 27 Updated Dec 16, 2022

Stability-AI / sd3.5

Python 1,007 74 Updated Jan 8, 2025

jsh-me / psnr-ssim-tool

Peak signal-to-noise ratio and The structural similarity calculation tool

Python 20 3 Updated Jun 26, 2020

XLabs-AI / x-flux

Python 1,903 135 Updated Nov 8, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 20,445 1,435 Updated Feb 6, 2025

lucidrains / flamingo-pytorch

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,231 59 Updated Oct 18, 2022

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 2,107 199 Updated Feb 22, 2025

amazon-science / glass-text-spotting

Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)

Python 102 12 Updated Jun 28, 2024

lllyasviel / Omost

Your image is almost there!

Python 7,504 427 Updated Jul 26, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,159 91 Updated Feb 16, 2025

THUDM / Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 411 20 Updated Jul 5, 2024

LC-John / Fashion-MNIST

TeX 29 15 Updated Mar 11, 2022

lcy0604 / CTRNet

This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context".

Python 85 8 Updated Feb 21, 2023

abyildirim / inst-inpaint

A novel inpainting framework that can remove objects from images based on the instructions given as text prompts.

Python 369 26 Updated Aug 13, 2023

advimman / lama

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,441 898 Updated Feb 5, 2025

PKU-ICST-MIPL / PosterLayout-CVPR2023

Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).

Python 129 6 Updated Jul 11, 2024

ZYM-PKU / UDiffText

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Python 217 17 Updated Feb 14, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,712 440 Updated Jan 12, 2025

jc19chaoj / simpleANN

A simple ANN implemented in Numpy

Python 1 Updated Feb 11, 2021

whlzy / FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Python 404 11 Updated Nov 10, 2024

bytedance / E2STR

The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

Python 49 4 Updated Jun 14, 2024

ZYM ZYM-PKU

Lists (3)

CAPTCHA

MGT-detect

SceneTextEdit

Stars