Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 5,291 515 Updated Feb 18, 2025

Saiyan-World / goku

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,377 239 Updated Feb 19, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 12,857 1,271 Updated Feb 17, 2025

smthemex / ComfyUI_Sonic

Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI

Python 524 47 Updated Feb 17, 2025

jixiaozhong / Sonic

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,718 145 Updated Feb 10, 2025

mshumer / OpenDeepResearcher

Jupyter Notebook 2,245 300 Updated Feb 3, 2025

YILING0013 / AI_NovelGenerator

使用ai生成多章节的长篇小说，自动衔接上下文、伏笔

Python 707 146 Updated Feb 20, 2025

deepseek-ai / DeepSeek-V3

Python 87,372 14,097 Updated Feb 18, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,149 2,124 Updated Feb 1, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,862 5,821 Updated Feb 22, 2025

deepseek-ai / DeepSeek-R1

79,930 10,327 Updated Feb 18, 2025

Tencent / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,379 478 Updated Feb 21, 2025

Eyeline-Research / Go-with-the-Flow

Motion-Controllable Video Diffusion via Warped Noise

Python 762 40 Updated Feb 17, 2025

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 26,835 2,204 Updated Feb 21, 2025

David-patrick-chuks / Riona-AI-Agent

Riona 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and still evolving 🚧—exciting new features coming soon! 🌟

TypeScript 2,817 430 Updated Feb 21, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 2,611 381 Updated Jan 19, 2025

huanngzh / ComfyUI-MVAdapter

Custom nodes for using MV-Adapter in ComfyUI.

Python 280 25 Updated Feb 13, 2025

modstart-lib / aigcpanel

AigcPanel 是一个简单易用的一站式AI数字人系统，支持视频合成、声音合成、声音克隆，简化本地模型管理、一键导入和使用AI模型。

TypeScript 1,325 196 Updated Feb 19, 2025

sdbds / TRELLIS-for-windows

Forked from microsoft/TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 147 8 Updated Dec 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KellHuang KellHuang

Block or report KellHuang

Lists (2)

Hallo

🚀 My stack

Stars

kijai / ComfyUI-HunyuanVideoWrapper

SkyworkAI / SkyReels-V1

amao2001 / ganloss-latent-space

stepfun-ai / Step-Audio

stepfun-ai / Step-Video-T2V

geekan / MetaGPT

sdbds / Zonos-for-windows

Zyphra / Zonos