-
Institute of Acoustics, Chinese Academy of Sciences
- Beijing, China
Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
Stars
Audio Dataset for training CLAP and other models
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
code and trained models for "Attentional Feature Fusion"
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonics-format signals.
Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.com/justinsalamon/scaper
Efficient Training of Audio Transformers with Patchout
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
A library for soundscape synthesis and augmentation