Stars
An open-source tool-augmented conversational language model from Fudan University
wangzai23333 / blivedm
Forked from xfgryujk/blivedm获取bilibili直播弹幕,使用WebSocket协议
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
🔊 Text-Prompted Generative Audio Model
LangChain & LangGraph AI PDF chatbot agent
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Generate 3D objects conditioned on text or images
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
An MBTI Exploration of Large Language Models
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调