Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,219 2,186 Updated Nov 11, 2024

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,022 5,726 Updated Dec 18, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,773 877 Updated Jul 31, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,823 715 Updated Dec 4, 2024

facebookresearch / pifuhd

High-Resolution 3D Human Digitization from A Single Image.

Python 9,576 1,463 Updated Aug 19, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,300 722 Updated Aug 5, 2024

ashawkey / stable-dreamfusion

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Python 8,385 736 Updated Dec 10, 2023

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,606 454 Updated Oct 12, 2024

google-deepmind / alphageometry

Python 4,231 478 Updated Oct 25, 2024

snipsco / snips-nlu

Snips Python library to extract meaning from text

Python 3,900 513 Updated May 22, 2023

spotify / basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 3,565 282 Updated Nov 4, 2024

run-youngjoo / SC-FEGAN

SC-FEGAN : Face Editing Generative Adversarial Network with User's Sketch and Color (ICCV2019)

Python 3,350 530 Updated May 20, 2024

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,193 258 Updated Sep 6, 2023

thearn / webcam-pulse-detector

A python application that detects and highlights the heart-rate of an individual (using only their own webcam) in real-time.

Python 3,162 596 Updated Jul 27, 2024

mkocabas / VIBE

Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"

Python 2,915 552 Updated Mar 24, 2023

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,854 265 Updated Jun 4, 2024

sxyu / svox2

Plenoxels: Radiance Fields without Neural Networks

Python 2,837 361 Updated Jun 29, 2023

knazeri / edge-connect

EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212

Python 2,534 532 Updated Feb 3, 2024

kevinzg / facebook-scraper

Scrape Facebook public pages without an API key

Python 2,498 636 Updated Jun 22, 2024

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,465 266 Updated Nov 8, 2024

elevenlabs / elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Python 2,284 271 Updated Dec 18, 2024

microsoft / Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Python 2,240 447 Updated Jan 4, 2024

yfeng95 / DECA

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

Python 2,184 433 Updated Jul 23, 2023

qianqianwang68 / omnimotion

Python 2,162 126 Updated Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mostafamdy

Block or report mostafamdy

Stars

AUTOMATIC1111 / stable-diffusion-webui

swisskyrepo / PayloadsAllTheThings

zylon-ai / private-gpt

lm-sys / FastChat

TencentARC / GFPGAN

hpcaitech / Open-Sora

facebookresearch / audiocraft