
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Python packaging and dependency management made easy
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
⚡ A Fast, Extensible Progress Bar for Python and CLI
Write scalable load tests in plain Python 🚗💨
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources. Featuring the Fiery Meter of AWSome.
An open source multi-tool for exploring and publishing data
Quickly rewrite git repository history (filter-branch replacement)
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
A Unified Toolkit for Deep Learning Based Document Image Analysis
A Python library to extract tabular data from PDFs
An API Client package to access the APIs for NBA.com
markdown2: A fast and complete implementation of Markdown in Python
Repository containing evidence of police brutality during the 2020 George Floyd protests
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame
Basic Utilities for PyTorch Natural Language Processing (NLP)
Members of the United States Congress, 1789-Present, in YAML/JSON/CSV, as well as committees, presidents, and vice presidents.
A Pure Python, React-style Framework for Scaling Your Jupyter and Web Apps
Python library to build pretty command line user prompts ✨Easy to use multi-select lists, confirmations, free text prompts ...
Public domain data collectors for the work of Congress, including legislation, amendments, and votes.
skweak: A software toolkit for weak supervision applied to NLP tasks
Data management framework for Python that provides functionality to describe, extract, validate, and transform tabular data