Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The Web framework for perfectionists with deadlines.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
LlamaIndex is a data framework for your LLM applications
Streamlit — A faster way to build and share data apps.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Data validation using Python type hints
Convert PDF to markdown + JSON quickly with high accuracy
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2
📷 Instagram Bot - Tool for automated Instagram interactions
State-of-the-Art Text Embeddings
OCR, layout analysis, reading order, table recognition in 90+ languages
SQL databases in Python, designed for simplicity, compatibility, and robustness.
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
OpenMMLab Text Detection, Recognition and Understanding Toolbox