Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,053 7,968 Updated Jan 29, 2025

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,908 5,123 Updated Oct 10, 2024

facebookresearch / detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,111 7,571 Updated Jan 14, 2025

open-mmlab / mmdetection

OpenMMLab Detection Toolbox and Benchmark

Python 30,127 9,526 Updated Aug 21, 2024

unclecode / crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 28,111 2,222 Updated Jan 31, 2025

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,356 3,232 Updated Sep 24, 2024

roboflow / supervision

We write your reusable computer vision tools. 💜

Python 24,747 1,843 Updated Jan 27, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 23,443 1,975 Updated Jan 30, 2025

HumanSignal / labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 23,171 6,369 Updated Jun 7, 2024

PaddlePaddle / PaddleDetection

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,040 2,913 Updated Jan 16, 2025

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,318 1,031 Updated Sep 26, 2024

gruns / icecream

🍦 Never use print() to debug again.

Python 9,468 197 Updated Jan 13, 2025

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,122 591 Updated Jan 31, 2025

open-mmlab / mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 8,505 2,647 Updated Aug 13, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,275 735 Updated Aug 12, 2024

yangchris11 / samurai

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,416 399 Updated Jan 29, 2025

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 5,732 621 Updated Sep 20, 2024

Baekalfen / PyBoy

Game Boy emulator written in Python

Python 4,693 488 Updated Jan 20, 2025

isl-org / MiDaS

Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"

Python 4,655 650 Updated Aug 23, 2024