Lists (25)
Sort Name ascending (A-Z)
ASR
Audio Datasets
Audio Embeddings
CV
Denoisers
Diarization
DiffSinger
Dropbox
Image generation
image inhance
ML
Multiprocessing
Music classification
Music enchance
Music generation
Music Loop
Music tagging
Photo animation
Silence detection
Study
SVC
SVS
Text summarization
Time stretch
vocal detection
Starred repositories
The Single Note Database, containing 30883 samples of 11 instruments from 4 datasets.
Python library for working with Music Information Retrieval datasets
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
a list of demo websites for automatic music generation research
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Dropbox Uploader is a BASH script which can be used to upload, download, list or delete files from Dropbox, an online file sharing, synchronization and backup service.
Dropbox Uploader is a Python script which can be used to upload, download, restore, list or delete files from Dropbox, an online file sharing, synchronization and backup service.
A Python package for easy multiprocessing, but faster than multiprocessing
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
A simple library for Fréchet Audio Distance (FAD) calculation
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.