Stars
Stable Diffusion web UI
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
Interact with your documents using the power of GPT, 100% privately, no data leaks
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Open-Sora: Democratizing Efficient Video Production for All
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Convert Machine Learning Code Between Frameworks
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
High-Resolution 3D Human Digitization from A Single Image.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Muzic: Music Understanding and Generation with Artificial Intelligence
Snips Python library to extract meaning from text
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
SC-FEGAN : Face Editing Generative Adversarial Network with User's Sketch and Color (ICCV2019)
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
A python application that detects and highlights the heart-rate of an individual (using only their own webcam) in real-time.
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Plenoxels: Radiance Fields without Neural Networks
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
Scrape Facebook public pages without an API key
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
The official Python API for ElevenLabs Text to Speech.
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)