Lists (5)
Sort Name ascending (A-Z)
Starred repositories
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Awesome music generation model——MG²
A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
AudioLDM: Generate speech, sound effects, music and beyond, with text.
AudioLDM training, finetuning, evaluation and inference.
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…
Code for Palu: Compressing KV-Cache with Low-Rank Projection
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Flexible LoRA Implementation to use with stable-audio-tools
Audio generation using diffusion models, in PyTorch.
Karras et al. (2022) diffusion models for PyTorch
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Vector (and Scalar) Quantization, in Pytorch
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
Generative models for conditional audio generation
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)
📄 A curated list of awesome .cursorrules files
A straightforward collection of Music Generation research resources.