Starred repositories
Hunt down social media accounts by username across social networks
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Geometric Computer Vision Library for Spatial AI
Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Self-Supervised Speech Pre-training and Representation Learning Toolkit
VMamba: Visual State Space Models,code is based on mamba
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)