Stars
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…
Fractal Graph-of-Thought. Rhizomatic Mind-Mapping for Ai-Agents, Web-Links, Notes, and Code.
Neo4j graph construction from unstructured data using LLMs
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX
A Python implementation of global optimization with gaussian processes.
Python based GBDT implementation on GPU. Efficient multioutput (multiclass/multilabel/multitask) training
An object-oriented algebraic modeling language in Python for structured optimization problems.
A plugin for reading and annotating PDFs and EPUBs in obsidian.
Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3
Discrete Optimization is a python library to ease the definition and re-use of discrete optimization problems and solvers.
Enforce the output format (JSON Schema, Regex etc) of a language model
Sync your ML data with your favorite productivity tools!
A generative AI extension for JupyterLab
DSPy: The framework for programming—not prompting—language models
pykoi: Active learning in one unified interface
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
Fast and memory-efficient exact attention
Benchmarking framework for protein representation learning. Includes a large number of pre-training and downstream task datasets, models and training/task utilities. (ICLR 2024)
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.