Stars
Wan: Open and Advanced Large-Scale Video Generative Models
Command and Conquer: Generals - Zero Hour
Janus-Series: Unified Multimodal Understanding and Generation Models
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
[ICLR'25] Official Implementation for Consistent Flow Distillation for Text-to-3D Generation
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers
SIGGRAPH 2024 Conference Paper: Deep Fourier-based Arbitrary-scale Super-resolution for Real-time Rendering
Implementing OCR with a local visual model run by ollama.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
[CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion
LlamaIndex is the leading framework for building LLM-powered agents over your data.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
OpenChat: Advancing Open-source Language Models with Imperfect Data
The slightly more awesome standard unix password manager for teams