Skip to content
View jingxuan9862's full-sized avatar

Block or report jingxuan9862

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
64 results for source starred repositories
Clear filter

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,895 412 Updated Jan 14, 2025
Python 7,153 558 Updated Jan 14, 2025

This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"

Python 42 4 Updated Oct 4, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,086 422 Updated Jul 28, 2024

A novel human-interaction method for real-time speech extraction on headphones.

Python 555 61 Updated Jun 5, 2024

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link a…

Python 338 98 Updated Nov 29, 2021

MT3: Multi-Task Multitrack Music Transcription

Python 1,468 195 Updated Dec 11, 2024

Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 335 94 Updated Jan 22, 2023

Perceptual Quality Estimator for speech and audio

C++ 723 128 Updated Aug 2, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,424 194 Updated Dec 11, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,577 2,584 Updated Jan 7, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,386 1,871 Updated Jan 16, 2025

An Industrial Grade Federated Learning Framework

Python 5,782 1,559 Updated Nov 19, 2024

A high performance and generic framework for distributed DNN training

Python 3,655 493 Updated Oct 3, 2023

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,382 617 Updated Oct 20, 2021

SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016

Lua 460 93 Updated Oct 7, 2017

Deezer source separation library including pretrained models.

Python 26,206 2,874 Updated Oct 29, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,873 2,626 Updated Jan 16, 2025

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,180 808 Updated Jun 16, 2022

Chinese text normalization for speech processing

Python 645 146 Updated Mar 18, 2023
Jupyter Notebook 109 18 Updated Oct 25, 2021

DDSP: Differentiable Digital Signal Processing

Python 2,950 344 Updated Sep 23, 2024

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 735 150 Updated Dec 1, 2020

The PyTorch-based audio source separation toolkit for researchers

Python 2,310 427 Updated Jan 11, 2025

A UNIFIED SPEECH ENHANCEMENT FRONT-END FOR ONLINE DEREVERBERATION, ACOUSTIC ECHO CANCELLATION, AND SOURCE SEPARATION

MATLAB 113 56 Updated Jun 18, 2022

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of sep…

Jupyter Notebook 313 34 Updated Jul 6, 2023

In defence of metric learning for speaker recognition

Python 1,077 276 Updated Mar 26, 2024
Next