Skip to content
View tuteng0915's full-sized avatar
  • Tsinghua U
  • Beijing

Block or report tuteng0915

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MT3: Multi-Task Multitrack Music Transcription

Python 1,481 194 Updated Dec 11, 2024
Python 2 Updated Jan 19, 2025

Towards Modality Generalization: A Benchmark and Prospective Analysis

Python 19 1 Updated Feb 9, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,523 142 Updated Feb 13, 2025

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Python 301 36 Updated Apr 8, 2024

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Python 156 10 Updated Apr 5, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,469 2,234 Updated Jan 15, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,505 789 Updated Jul 31, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 261 19 Updated Mar 25, 2024

Evaluation functions for music/audio information retrieval/signal processing algorithms.

Python 623 117 Updated Feb 10, 2025

A curated list of Video to Audio Generation

16 1 Updated Oct 17, 2024

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Python 156 21 Updated Jul 30, 2024

Manually annotated chord data set of US pop songs and Popular Music Collection of RWC Music Database

Python 86 13 Updated Apr 9, 2013

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 4,907 598 Updated Feb 8, 2025

A large-scale dataset of caption-annotated MIDI files.

Python 55 3 Updated Jul 23, 2024
Jupyter Notebook 160 10 Updated Jul 5, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 218 12 Updated Jul 25, 2024

The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.

Jupyter Notebook 147 5 Updated Dec 22, 2023

Stable Diffusion web UI

Python 147,683 27,605 Updated Feb 10, 2025
Python 17 1 Updated Jan 16, 2025

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 37,152 3,831 Updated Jan 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,314 4,939 Updated Feb 12, 2025

A curated list of awesome 3d generation papers

1,115 55 Updated Mar 9, 2023
Python 2 Updated Nov 24, 2023

Responsive Resume Cv Website Using HTML CSS And JavaScript

HTML 296 174 Updated Mar 31, 2024

A modern static resume template and theme. Powered by Jekyll and GitHub pages.

HTML 2,126 1,420 Updated Jun 15, 2024

[ICCV 2023] Online Clustered Codebook

Python 157 10 Updated Sep 19, 2024

Longformer: The Long-Document Transformer

Python 2,082 277 Updated Feb 8, 2023
Next