Skip to content
View Luis-zhang's full-sized avatar

Block or report Luis-zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 253 45 Updated Jan 2, 2025

Awesome music generation model——MG²

Python 131 11 Updated Jan 21, 2025

A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.

269 21 Updated Nov 3, 2023

Awesome Music Projects

1,941 111 Updated Jan 2, 2025

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

Python 4,596 405 Updated Jan 2, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,344 5,609 Updated Feb 1, 2025

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,537 227 Updated Dec 9, 2024

AudioLDM training, finetuning, evaluation and inference.

Python 231 45 Updated Dec 13, 2024

Contrastive Language-Audio Pretraining

Python 1,514 149 Updated Nov 21, 2024

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 603 55 Updated Jan 27, 2025

解决Cursor在免费订阅期间出现以下提示的问题: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to prevent abuse. Please l…

Go 7,785 1,061 Updated Jan 31, 2025

Code for Palu: Compressing KV-Cache with Low-Rank Projection

Python 64 3 Updated Jan 31, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,581 394 Updated Sep 25, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,265 122 Updated Jul 11, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,579 313 Updated Jan 4, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,398 2,221 Updated Jan 15, 2025

Flexible LoRA Implementation to use with stable-audio-tools

Python 57 4 Updated Sep 9, 2024

Audio generation using diffusion models, in PyTorch.

Python 2,006 168 Updated Jun 12, 2023
Python 82 9 Updated May 31, 2023

🎛 🔊 A Python library for audio.

C++ 5,352 275 Updated Nov 26, 2024

Karras et al. (2022) diffusion models for PyTorch

Python 2,379 384 Updated Jan 7, 2025

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 160 11 Updated Jul 25, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,869 233 Updated Jan 28, 2025

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 185 20 Updated Nov 18, 2024

Generative models for conditional audio generation

Python 2,857 281 Updated Jan 10, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,385 27,773 Updated Jan 31, 2025

A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)

Python 156 16 Updated Jun 6, 2024

📄 A curated list of awesome .cursorrules files

8,825 583 Updated Jan 29, 2025

A straightforward collection of Music Generation research resources.

593 36 Updated Jan 20, 2025

Text-to-Audio/Music Generation

Python 2,363 183 Updated Sep 29, 2024
Next