Lists (1)
Sort Name ascending (A-Z)
Starred repositories
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
SGLang is a fast serving framework for large language models and vision language models.
Efficiently find the best-suited language model (LM) for your NLP task
Virtual whiteboard for sketching hand-drawn like diagrams
DSPy: The framework for programming—not prompting—language models
NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
PlayStation 4 emulator for Windows, Linux and macOS written in C++
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
Generate accurate transcripts using Apple's MLX framework
A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically draws bounding boxes around detected objects, labels them, and…
Convert PDF to markdown + JSON quickly with high accuracy
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3,…
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.