Skip to content
View jingxuan9862's full-sized avatar

Block or report jingxuan9862

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
47 stars written in Python
Clear filter

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 64,673 11,140 Updated Jul 30, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,070 8,834 Updated Aug 14, 2024

Deezer source separation library including pretrained models.

Python 26,110 2,866 Updated Oct 29, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,480 2,576 Updated Dec 15, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,555 2,576 Updated Dec 29, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,325 1,866 Updated Dec 27, 2024

A PyTorch-based Speech Toolkit

Python 9,108 1,413 Updated Dec 28, 2024

End-to-End Speech Processing Toolkit

Python 8,627 2,200 Updated Dec 28, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,517 1,089 Updated Apr 24, 2024
Python 7,059 552 Updated Dec 20, 2024

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,936 1,202 Updated Mar 31, 2024

An Industrial Grade Federated Learning Framework

Python 5,767 1,558 Updated Nov 19, 2024

《21个项目玩转深度学习———基于TensorFlow的实践详解》配套代码

Python 4,532 1,756 Updated Mar 18, 2019

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,784 372 Updated Dec 28, 2024

A high performance and generic framework for distributed DNN training

Python 3,641 491 Updated Oct 3, 2023

DDSP: Differentiable Digital Signal Processing

Python 2,931 345 Updated Sep 23, 2024

An optimizer that trains as fast as Adam and as good as SGD.

Python 2,908 330 Updated Jul 23, 2023

Paper and implementation of UNet-related model.

Python 2,521 504 Updated May 21, 2020

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,398 193 Updated Dec 11, 2024

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,381 617 Updated Oct 20, 2021

[IEEE TMI] Official Implementation for UNet++

Python 2,345 543 Updated Nov 15, 2023

The PyTorch-based audio source separation toolkit for researchers

Python 2,300 424 Updated Jul 19, 2024

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,180 811 Updated Jun 16, 2022

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Python 1,962 349 Updated Nov 28, 2022

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,566 320 Updated Sep 25, 2024

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,489 434 Updated Dec 8, 2024

MT3: Multi-Task Multitrack Music Transcription

Python 1,457 195 Updated Dec 11, 2024

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,141 415 Updated Jul 25, 2024

In defence of metric learning for speaker recognition

Python 1,074 274 Updated Mar 26, 2024
Next