Stars
Collection of publicly available IPTV channels from all over the world
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🔊 Text-Prompted Generative Audio Model
A course on aligning smol models.
NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
A Unified Toolkit for Deep Learning-Based Table Extraction
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Simple package to extract text with coordinates from programmatic PDFs
A High-efficiency Open-source Toolkit for Table-to-Latex Task
A Comprehensive Toolkit for High-Quality PDF Content Extraction
UniTable: Towards a Unified Table Foundation Model
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Analyze PDFs. With colors. And Yara.
A machine learning software for extracting information from scholarly documents
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Convert PDF to markdown + JSON quickly with high accuracy
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Experiment and integrate with different OCR frameworks seamlessly
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Learn Low Level Design (LLD) and prepare for interviews using free resources.
Learn System Design concepts and prepare for interviews using free resources.