mllife

mllife

1 follower · 13 following

Stars

iptv-org / iptv

Collection of publicly available IPTV channels from all over the world

JavaScript 88,427 2,918 Updated Jan 1, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,485 4,494 Updated Aug 16, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,553 4,301 Updated Aug 19, 2024

huggingface / smol-course

A course on aligning smol models.

Jupyter Notebook 3,737 1,197 Updated Dec 30, 2024

NVIDIA-AI-Blueprints / multimodal-pdf-data-extraction

NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG

241 27 Updated Nov 19, 2024

DS4SD / deepsearch-glm

Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.

C++ 31 7 Updated Dec 9, 2024

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,000 116 Updated Dec 24, 2024

CycloneBoy / pdf_table

A Unified Toolkit for Deep Learning-Based Table Extraction

Python 27 2 Updated Nov 21, 2024

arvindrajan92 / DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Python 113 11 Updated Aug 18, 2024

JiaquanYe / TableMASTER-mmocr

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Python 447 105 Updated Jul 4, 2022

DS4SD / docling

Get your documents ready for gen AI

Python 17,080 891 Updated Dec 19, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,030 451 Updated Dec 31, 2024

DS4SD / docling-parse

Simple package to extract text with coordinates from programmatic PDFs

C++ 42 10 Updated Dec 16, 2024

UniModal4Reasoning / StructEqTable-Deploy

A High-efficiency Open-source Toolkit for Table-to-Latex Task

Python 182 13 Updated Dec 12, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,231 407 Updated Dec 10, 2024

poloclub / unitable

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 397 30 Updated Jun 4, 2024

Kanaries / pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

Python 13,643 711 Updated Dec 30, 2024

michelcrypt4d4mus / pdfalyzer

Analyze PDFs. With colors. And Yara.

Python 250 19 Updated Dec 14, 2024

harrytea / Awesome-Document-Understanding

Document Artifical Intelligence

135 5 Updated Dec 8, 2024

DS4SD / docling-ibm-models

Python 55 10 Updated Dec 16, 2024

kermitt2 / grobid

A machine learning software for extracting information from scholarly documents

Java 3,685 459 Updated Jan 1, 2025

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 25,022 3,200 Updated Sep 24, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,803 1,682 Updated Aug 1, 2024

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 18,931 1,107 Updated Jan 1, 2025

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,092 458 Updated Dec 20, 2024

ajkdrag / ocrtoolkit

Experiment and integrate with different OCR frameworks seamlessly

Jupyter Notebook 104 4 Updated Apr 10, 2024

piegu / language-models

pre-trained Language Models

Jupyter Notebook 295 99 Updated Sep 5, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,523 4,306 Updated Jul 28, 2024

ashishps1 / awesome-low-level-design

Learn Low Level Design (LLD) and prepare for interviews using free resources.

Java 9,828 2,466 Updated Dec 30, 2024

ashishps1 / awesome-system-design-resources

Learn System Design concepts and prepare for interviews using free resources.

Java 19,325 4,648 Updated Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly