Starred repositories
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
A simple GUI designer for the python tkinter module
Python tool for grabbing text via screenshot
Open-source foundation of the user-sponsored PyMOL molecular visualization system.
Python tool for converting files and office documents to Markdown.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
JohannesBuchner / imagehash
Forked from bunchesofdonald/photohashA Python Perceptual Image Hashing Module
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
Open-source platform for extracting structured data from documents using AI.
Continuation of an abandoned project fast-coco-eval
整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Simple project page template for your research paper, built with Astro and Tailwind CSS
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Official data repository for the Open Reaction Database
Simple Implementation of Pix2Seq model for object detection in PyTorch
360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
A Unified Toolkit for Deep Learning-Based Table Extraction
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
SciMind: A Multimodal Mixture-of-Experts Model for Advancing Pharmaceutical Sciences
A small, highly performant JavaScript component for parsing and drawing SMILES strings. Released under the MIT license.
Open Parser for Systematic IUPAC Nomenclature. Chemical name to structure conversion