Starred repositories
Robust Speech Recognition via Large-Scale Weak Supervision
The world's simplest facial recognition api for Python and the command line
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
TensorFlow code and pre-trained models for BERT
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
A generative speech model for daily dialogue.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
State-of-the-art 2D and 3D Face Analysis Project
A Deep Learning based project for colorizing and restoring old images (and video!)
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)
🔥🔥🔥AidLearning is a powerful AIOT development platform, AidLearning builds a linux env supporting GUI, deep learning and visual IDE on Android...Now Aid supports CPU+GPU+NPU for inference with high…
All-in-One Development Tool based on PaddlePaddle(飞桨低代码开发工具)
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
新闻网页正文通用抽取器 Beta 版.
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥