Stars
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
TF-ID: Table/Figure IDentifier for academic papers
img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
UniTable: Towards a Unified Table Foundation Model
Convert PDF to markdown + JSON quickly with high accuracy
OCR, layout analysis, reading order, table recognition in 90+ languages
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
Data processing with ML, LLM and Vision LLM
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
LlamaIndex is a data framework for your LLM applications
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Python bindings for the Transformer models implemented in C/C++ using GGML library.
Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.
π§βπ The better identity infrastructure for developers and the open-source alternative to Auth0.
Extracts emails and attachments saved in Microsoft Outlook's .msg files
An async Python micro framework for building web applications.
Get a ChatGPT plugin up and running in under 5 minutes!
React components to build charts and dashboards
Utilities to use the Hugging Face Hub API