Skip to content
View jingxuan9862's full-sized avatar

Block or report jingxuan9862

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
45 results for source starred repositories written in Python
Clear filter

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 65,115 11,186 Updated Jul 30, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,237 8,863 Updated Aug 14, 2024

Deezer source separation library including pretrained models.

Python 26,209 2,874 Updated Oct 29, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,578 2,583 Updated Jan 7, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,878 2,629 Updated Jan 17, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,388 1,872 Updated Jan 16, 2025

A PyTorch-based Speech Toolkit

Python 9,198 1,421 Updated Jan 16, 2025

End-to-End Speech Processing Toolkit

Python 8,684 2,206 Updated Jan 15, 2025
Python 7,155 558 Updated Jan 14, 2025

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,953 1,203 Updated Mar 31, 2024

An Industrial Grade Federated Learning Framework

Python 5,781 1,559 Updated Nov 19, 2024

《21个项目玩转深度学习———基于TensorFlow的实践详解》配套代码

Python 4,532 1,756 Updated Mar 18, 2019

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,896 412 Updated Jan 14, 2025

A high performance and generic framework for distributed DNN training

Python 3,655 493 Updated Oct 3, 2023

DDSP: Differentiable Digital Signal Processing

Python 2,951 344 Updated Sep 23, 2024

An optimizer that trains as fast as Adam and as good as SGD.

Python 2,909 332 Updated Jul 23, 2023

Paper and implementation of UNet-related model.

Python 2,525 504 Updated May 21, 2020

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,424 194 Updated Dec 11, 2024

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,382 617 Updated Oct 20, 2021

[IEEE TMI] Official Implementation for UNet++

Python 2,359 546 Updated Jan 11, 2025

The PyTorch-based audio source separation toolkit for researchers

Python 2,310 427 Updated Jan 11, 2025

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,180 808 Updated Jun 16, 2022

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Python 1,967 352 Updated Nov 28, 2022

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,566 321 Updated Sep 25, 2024

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,499 435 Updated Jan 3, 2025

MT3: Multi-Task Multitrack Music Transcription

Python 1,468 195 Updated Dec 11, 2024

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,144 420 Updated Jul 25, 2024

In defence of metric learning for speaker recognition

Python 1,077 276 Updated Mar 26, 2024

Implementation of the Wave-U-Net for audio source separation

Python 859 178 Updated Mar 24, 2023
Next