Stars
The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"
Stable Video Diffusion Training Code and Extensions.
Official implementation of "Dataset Size Recovery from LoRA Weights" paper.
Real-Time Deepfake Detection in the Real-World
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for…
Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).
Vector (and Scalar) Quantization, in Pytorch
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…
A Python package to process Pandas Dataframe using multi-processing
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.
Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation