Stars
Stable Diffusion web UI
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Scrapy, a fast high-level web crawling & scraping framework for Python.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
A Gradio web UI for Large Language Models with support for multiple inference backends.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Resolution Image Synthesis with Latent Diffusion Models
A high-throughput and memory-efficient inference and serving engine for LLMs
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A generative speech model for daily dialogue.
aider is AI pair programming in your terminal
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
Universal LLM Deployment Engine with ML Compilation
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Freeze (package) Python programs into stand-alone executables
💮 amazing QRCode generator in Python (supporting animated gif) - Python amazing 二维码生成器(支持 gif 动态图片二维码)
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.