Starred repositories
Stable Diffusion web UI
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
real time face swap and one-click video deepfake with only a single image
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
High-Resolution Image Synthesis with Latent Diffusion Models
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Official Code for DragGAN (SIGGRAPH 2023)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
A generative speech model for daily dialogue.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Instant voice cloning by MIT and MyShell.
OpenMMLab Detection Toolbox and Benchmark
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Real-time face swap for PC streaming or video calls
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
SoftVC VITS Singing Voice Conversion
Deezer source separation library including pretrained models.