Stars
Multi-Tasks (Semantic Segmentation + Depth Estimation) with Real-Time Light-Weight RefineNet
Multi-Task (Joint Segmentation / Depth / Surface Normas) Real-Time Light-Weight RefineNet
Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation
Unsupervised single image depth prediction with CNNs
Unofficial implementation of Unsupervised Monocular Depth Estimation neural network MonoDepth in PyTorch
Pytorch version of SfmLearner from Tinghui Zhou et al.
Light-Weight RefineNet for Real-Time Semantic Segmentation
当下热门的模糊人脸修复模型的部署,分别是:Codeformer,GFPGAN,GPEN,Restoreformer
DifFace: Blind Face Restoration with Diffused Error Contraction (TPAMI, 2024)
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A non-native English corpus for pronunciation scoring task
Script for training a non-chain tdnn model for standard mandarin GOP scoring. The training data is from a filtered subset of AISHELL2 and MAGICDATA which contain (relatively) standard Mandarin pron…
Anki Deck to learn Mandarin Chinese pronunciation basics using Pinyin and IPA
Using two stream architecture to implement a classic action recognition method on UCF101 dataset
Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
Official pytorch Code for CVPR2019 paper "Fast Human Pose Estimation" https://arxiv.org/abs/1811.05419
pytorch implementation of openpose including Hand and Body Pose Estimation.
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Convert mediapipe models to pytorch checkpoints
TexasInstruments / edgeai-yolov5
Forked from ultralytics/yolov5YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite. Forked from https://ultralytics.com/yolov5
Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
Distribution-Aware Coordinate Representation for Human Pose Estimation
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"