Skip to content
View KellHuang's full-sized avatar

Block or report KellHuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,229 91 Updated Feb 21, 2025

有趣的80后程序员的工作流分享

342 80 Updated Feb 22, 2025
Python 2,868 209 Updated Feb 21, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 47,065 5,609 Updated Feb 19, 2025
Python 302 31 Updated Feb 21, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 5,291 515 Updated Feb 18, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,377 239 Updated Feb 19, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 12,857 1,271 Updated Feb 17, 2025

Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI

Python 524 47 Updated Feb 17, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,718 145 Updated Feb 10, 2025
Jupyter Notebook 2,245 300 Updated Feb 3, 2025

使用ai生成多章节的长篇小说,自动衔接上下文、伏笔

Python 707 146 Updated Feb 20, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,149 2,124 Updated Feb 1, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,862 5,821 Updated Feb 22, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,379 478 Updated Feb 21, 2025

Motion-Controllable Video Diffusion via Warped Noise

Python 762 40 Updated Feb 17, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 26,835 2,204 Updated Feb 21, 2025

Riona 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and still evolving 🚧—exciting new features coming soon! 🌟

TypeScript 2,817 430 Updated Feb 21, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,611 381 Updated Jan 19, 2025

Custom nodes for using MV-Adapter in ComfyUI.

Python 280 25 Updated Feb 13, 2025

AigcPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。

TypeScript 1,325 196 Updated Feb 19, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 147 8 Updated Dec 28, 2024

Autonomous agents for everyone

TypeScript 14,610 4,598 Updated Feb 22, 2025

Official implementation of SVFR.

Python 728 69 Updated Jan 19, 2025

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Python 952 53 Updated Jan 22, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 16,858 1,867 Updated Feb 16, 2025
Next