Skip to content
View Jinbo-Hu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Institute of Acoustics, Chinese Academy of Sciences
  • Beijing, China

Highlights

  • Pro

Block or report Jinbo-Hu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Audio Dataset for training CLAP and other models

Python 654 54 Updated Feb 5, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 105,458 8,430 Updated Jan 2, 2025

code and trained models for "Attentional Feature Fusion"

Python 759 95 Updated Jul 23, 2021

VGGSound: A Large-scale Audio-Visual Dataset

Python 297 32 Updated Sep 13, 2021

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 147 17 Updated Apr 23, 2024

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 104 2 Updated Dec 13, 2024

🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)

Python 35 4 Updated Oct 12, 2024

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 339 18 Updated Sep 1, 2023

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Python 6 Updated Dec 20, 2024

Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonics-format signals.

Python 4 Updated Nov 13, 2024
Python 291 70 Updated Feb 28, 2020

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

Python 34 5 Updated Oct 26, 2020

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 9,389 2,621 Updated Mar 15, 2024

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Python 519 44 Updated Mar 24, 2022

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

Python 278 21 Updated Sep 17, 2023

ARCH: Audio Representations benCHmark

Python 39 3 Updated Aug 26, 2024
2 Updated Jun 6, 2024
Python 8 Updated Oct 8, 2023
Jupyter Notebook 32 4 Updated Aug 11, 2024

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 664 57 Updated Jul 2, 2024

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,069 93 Updated Sep 2, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,847 1,670 Updated Dec 19, 2024

Ambiscaper: a tool for automatic dataset generation and annotation of reverberant Ambisonics audio. Originally forked from http://github.com/justinsalamon/scaper

Python 21 6 Updated Sep 14, 2018

Efficient Training of Audio Transformers with Patchout

Python 312 51 Updated Jan 12, 2024

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,135 2,076 Updated Jul 24, 2024

A library for soundscape synthesis and augmentation

Python 384 58 Updated May 4, 2022
Next