
Lists (10)
Sort Name ascending (A-Z)
Starred repositories
Explore the Multimodal “Aha Moment” on 2B Model
A comprehensive video analysis tool that combines computer vision, audio transcription, and natural language processing to generate detailed descriptions of video content. This tool extracts key fr…
Public facing notes page
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
100 % FREE, Private (No Internet) DeepSeek’s Advanced RAG: Boost Your RAG Chatbot: Hybrid Retrieval (BM25 + FAISS) + Neural Reranking + HyDe🚀
A high-throughput and memory-efficient inference and serving engine for LLMs
Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.
This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!
Make websites accessible for AI agents
Sample apps to help developers get started with Structured Outputs
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
A python module to repair invalid JSON from LLMs
Curated list of datasets and tools for post-training.
A simple screen parsing tool towards pure vision based GUI agent
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Development repository for the Triton language and compiler
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Parse PDFs into markdown using Vision LLMs
OCR & Document Extraction using vision models
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
Fully local web research and report writing assistant
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
A simple open-source chat app that uses Exa's API for web search and Deepseek R1 for reasoning