-
minivision
- NanJing, China
Stars
Models and examples built with TensorFlow
The world's simplest facial recognition api for Python and the command line
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
A generative world for general-purpose robotics & embodied AI learning.
A modular graph-based Retrieval-Augmented Generation (RAG) system
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Avatars for Zoom, Skype and other video-conferencing apps.
StyleGAN - Official TensorFlow Implementation
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Keras implementations of Generative Adversarial Networks.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
Real-Time High-Resolution Background Matting
Enjoy the magic of Diffusion models!