Stars
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AI伴侣/AI女友/AI男友合集,整合记录目前流行的AI生成图片和视频的合集,探索AI虚拟数字人过程
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
How do we integrate AI generation tools into actual work? | 关于 Ai 绘画的Wiki | Wiki about Ai painting | Prompts Engineering| 指南 Guide | Seeking Maintainer&Translator🙌
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Foundational Models for State-of-the-Art Speech and Text Translation
A version 1.1 of the Alexander Koch low cost robot arm with some small changes.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
Training YOLOv5/YOLOv9 to detect fire in a video
fire-smoke-detect-yolov4-yolov5 and fire-smoke-detection-dataset 火灾检测,烟雾检测
Clone a voice in 5 seconds to generate arbitrary speech in real-time
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us…
A modular graph-based Retrieval-Augmented Generation (RAG) system
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A generative speech model for daily dialogue.