-
DeepCam
- Shenzhen, China
- www.deepcam.cn
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A natural language interface for computers
Clone a voice in 5 seconds to generate arbitrary speech in real-time
real time face swap and one-click video deepfake with only a single image
A Gradio web UI for Large Language Models with support for multiple inference backends.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
High-Resolution Image Synthesis with Latent Diffusion Models
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
LlamaIndex is a data framework for your LLM applications
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
A programming framework for agentic AI 🤖 (PyPi: autogen-agentchat)
A generative speech model for daily dialogue.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A high-throughput and memory-efficient inference and serving engine for LLMs
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Instant voice cloning by MIT and MyShell.
OpenMMLab Detection Toolbox and Benchmark
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Real-time face swap for PC streaming or video calls
An open-source PAM tool alternative to CyberArk. 广受欢迎的开源堡垒机。
State-of-the-art 2D and 3D Face Analysis Project