Stars
Easily train a good VC model with voice data <= 10 mins!
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
๐ธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
Official PyTorch implementation of "AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network" in CVPR 2022.
Open-Unmix - Music Source Separation for PyTorch
This repository has implementation for "Neural Voice Cloning With Few Samples"
๐ฆ LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
PyTorch implementation of the TIP2017 paper "Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising"
End-to-End Unpaired Image Denoising with Conditional Adversarial Networks (AAAI-20)
EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Automated Machine Learning with scikit-learn
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis
Video lstm auto encoder built with pytorch. https://arxiv.org/pdf/1502.04681.pdf
A Collection of Variational Autoencoders (VAE) in PyTorch.