Lists (2)
Sort Name ascending (A-Z)
Stars
SkyReels V1: The first and most advanced open-source human-centric video foundation model
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Janus-Series: Unified Multimodal Understanding and Generation Models
A high-throughput and memory-efficient inference and serving engine for LLMs
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Motion-Controllable Video Diffusion via Warped Noise
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Riona 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and still evolving 🚧—exciting new features coming soon! 🌟
Custom nodes for using MV-Adapter in ComfyUI.
AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
sdbds / TRELLIS-for-windows
Forked from microsoft/TRELLISOfficial repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.