Stars
一个还算强大的Web思维导图。A relatively powerful web mind map.
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
A generative speech model for daily dialogue.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
使用threejs和cannonjs实现物理碰撞检测,人物移动控制摇杆操控,相机旋转,拍照人物动作图片合成
Demos for xr-frame system in wx-mini-program.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
C0untFloyd / bark-gui
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model with Gradio
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Bark Voice Cloning and Voice Cloning for Chinese Speech
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
A C# implementation of the WebSocket protocol client and server
WeChat MiniProgram adapted version of Three.js
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
✨ 用Vue3 + Vite + Tailwindcss 复刻ChatGPT!体验一模一样的web-app!✨
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.