Stars
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AI伴侣/AI女友/AI男友合集,整合记录目前流行的AI生成图片和视频的合集,探索AI虚拟数字人过程
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
How do we integrate AI generation tools into actual work? | 关于 Ai 绘画的Wiki | Wiki about Ai painting | Prompts Engineering| 指南 Guide | Seeking Maintainer&Translator🙌
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Foundational Models for State-of-the-Art Speech and Text Translation
A version 1.1 of the Alexander Koch low cost robot arm with some small changes.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
Training YOLOv5/YOLOv9 to detect fire in a video
fire-smoke-detect-yolov4-yolov5 and fire-smoke-detection-dataset 火灾检测,烟雾检测
Clone a voice in 5 seconds to generate arbitrary speech in real-time
FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。
The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul openai key. If keys exceed plan and are invalid, please tell us…
A modular graph-based Retrieval-Augmented Generation (RAG) system
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
A generative speech model for daily dialogue.