Stars
O1 Replication Journey: A Strategic Progress Report – Part I
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Everything about the SmolLM & SmolLM2 family of models
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it…
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
State-of-the-Art Text Embeddings
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Official documentation for getting things done with Nix.
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
High-resolution models for human tasks.
An open-source RAG-based tool for chatting with your documents.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
real time face swap and one-click video deepfake with only a single image
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
Running large language models on a single GPU for throughput-oriented scenarios.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A Bulletproof Way to Generate Structured JSON from Language Models
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
This repos contains notebooks for the Advanced Solutions Lab: ML Immersion
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Open-Sora: Democratizing Efficient Video Production for All
Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)