GinChow

Jin Zhou GinChow

Zhejiang University;

31 followers · 33 following

Achievements

Lists (12)

Sort

Stars

qiuqiangkong / audioset_tagging_cnn

Python 1,393 259 Updated Jul 25, 2024

Fraunhofer-IIS / ODAQ

Python 36 3 Updated Oct 9, 2024

google / visqol

Perceptual Quality Estimator for speech and audio

C++ 723 128 Updated Aug 2, 2024

zfang399 / AlignNet

AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)

Python 32 4 Updated Jan 10, 2021

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 24,052 5,478 Updated Dec 5, 2024

yukara-ikemiya / minimal-musicgen-for-developers

[PyTorch] Minimal codebase for MusicGen models

Python 46 Updated Jan 7, 2025

benlubas / molten-nvim

A neovim plugin for interactively running code with the jupyter kernel. Fork of magma-nvim with improvements in image rendering, performance, and more

Python 687 36 Updated Jan 15, 2025

PyAV-Org / PyAV

Pythonic bindings for FFmpeg's libraries.

Cython 2,615 375 Updated Jan 16, 2025

naver-ai / rewas

Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"

Python 30 Updated Dec 13, 2024

lochenchou / MOSNet

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 349 64 Updated Jul 21, 2024

v-iashin / Synchformer

Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)

Python 39 5 Updated Apr 25, 2024

naver-ai / tc-clip

[ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"

Python 44 7 Updated Sep 26, 2024

ispamm / Stable-V2A

Stable-V2A: Synthesis of Synchronized Sound Effect with Temporal and Semantic Controls

10 Updated Dec 20, 2024

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,199 221 Updated May 21, 2023

hkchengrex / MMAudio

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 969 103 Updated Jan 14, 2025

youngsheen / SimVQ

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 191 5 Updated Dec 29, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,472 581 Updated Jan 17, 2025

JishengBai / AudioSetCaps

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 111 2 Updated Dec 13, 2024

haoheliu / versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 1,258 130 Updated Jan 9, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,688 938 Updated Jan 15, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,991 1,194 Updated Jan 15, 2025

audioset / ontology

The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.

655 152 Updated May 21, 2018

turpaultn / DESED

Repo associated to the DESED dataset, download and creation of data

Python 131 15 Updated Jul 16, 2024

yorukot / superfile

Pretty fancy and modern terminal file manager

Go 8,721 200 Updated Jan 13, 2025

Standard-Intelligence / hertz-dev

first base model for full-duplex conversational audio

Python 1,685 113 Updated Jan 5, 2025

snap-research / GenAU

Jupyter Notebook 19 Updated Dec 24, 2024

cdjkim / audiocaps

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 148 17 Updated Apr 23, 2024

robertanto / Real-Time-Sound-Event-Detection

This repository contains the python implementation of a Sound Event Detection systems working in real time.

Python 53 8 Updated Oct 10, 2022

mathieulagrange / dcaseFadEmbedding

Python 1 Updated Aug 18, 2024

DCASE2024-Task7-Sound-Scene-Synthesis / fadtk

Forked from microsoft/fadtk

A simple pytorch library for Fréchet Audio Distance (FAD) calculation

Python 4 1 Updated Dec 5, 2024

Jin Zhou GinChow

Lists (12)

AI-related

🚀AI-Tools

Algo

book

Data

DataCrawler

🚀 My stack

neovim

quant

🦀 rust

🔉 Speech_related

webdev

Stars