Luis-zhang

Follow

Luis-zhang

Follow

1 follower · 4 following

Lists (5)

Sort

🔮 Future ideas

工具

文本编码

跨模态对齐

音频生成

Starred repositories

descriptinc / audiotools

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 253 45 Updated Jan 2, 2025

shaopengw / Awesome-Music-Generation

Awesome music generation model——MG²

Python 131 11 Updated Jan 21, 2025

Curated-Awesome-Lists / awesome-ai-music-generation

A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.

269 21 Updated Nov 3, 2023

noteflakes / awesome-music

Awesome Music Projects

1,941 111 Updated Jan 2, 2025

shibing624 / text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Python 4,596 405 Updated Jan 2, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,344 5,609 Updated Feb 1, 2025

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,537 227 Updated Dec 9, 2024

haoheliu / AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Python 231 45 Updated Dec 13, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,514 149 Updated Nov 21, 2024

declare-lab / TangoFlux

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 603 55 Updated Jan 27, 2025

yuaotian / go-cursor-help

解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…

Go 7,785 1,061 Updated Jan 31, 2025

shadowpa0327 / Palu

Code for Palu: Compressing KV-Cache with Low-Rank Projection

Python 64 3 Updated Jan 31, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,581 394 Updated Sep 25, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,265 122 Updated Jul 11, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,579 313 Updated Jan 4, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,398 2,221 Updated Jan 15, 2025

NeuralNotW0rk / LoRAW

Flexible LoRA Implementation to use with stable-audio-tools

Python 57 4 Updated Sep 9, 2024

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,006 168 Updated Jun 12, 2023

zqevans / audio-diffusion

Python 82 9 Updated May 31, 2023

spotify / pedalboard

🎛 🔊 A Python library for audio.

C++ 5,352 275 Updated Nov 26, 2024

crowsonkb / k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Python 2,379 384 Updated Jan 7, 2025

yukara-ikemiya / friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 160 11 Updated Jul 25, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 2,869 233 Updated Jan 28, 2025

Stability-AI / stable-audio-metrics

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 185 20 Updated Nov 18, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,857 281 Updated Jan 10, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,385 27,773 Updated Jan 31, 2025

crlandsc / tiny-audio-diffusion

A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)

Python 156 16 Updated Jun 6, 2024

PatrickJS / awesome-cursorrules

📄 A curated list of awesome .cursorrules files

8,825 583 Updated Jan 29, 2025

AI-Guru / music-generation-research

A straightforward collection of Music Generation research resources.

593 36 Updated Jan 20, 2025

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,363 183 Updated Sep 29, 2024

Starred topics

JavaScript