Skip to content
View jingxuan9862's full-sized avatar

Block or report jingxuan9862

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
53 results for source starred repositories written in Python
Clear filter

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 66,458 11,320 Updated Jul 30, 2024

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,661 8,904 Updated Aug 14, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,278 5,298 Updated Mar 6, 2025

Making large AI models cheaper, faster and more accessible

Python 40,548 4,478 Updated Mar 6, 2025

Deezer source separation library including pretrained models.

Python 26,477 2,893 Updated Jan 24, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,852 2,602 Updated Mar 4, 2025

SOTA Open Source TTS

Python 19,771 1,528 Updated Mar 3, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,259 2,721 Updated Mar 7, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,592 1,892 Updated Mar 5, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,565 1,150 Updated Mar 7, 2025

A PyTorch-based Speech Toolkit

Python 9,459 1,446 Updated Mar 6, 2025

End-to-End Speech Processing Toolkit

Python 8,844 2,224 Updated Mar 3, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,813 776 Updated Feb 11, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,672 616 Updated Mar 6, 2025

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,997 1,209 Updated Mar 31, 2024

An Industrial Grade Federated Learning Framework

Python 5,831 1,561 Updated Nov 19, 2024

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaki…

Python 4,950 566 Updated Mar 6, 2025

《21个项目玩转深度学习———基于TensorFlow的实践详解》配套代码

Python 4,550 1,757 Updated Mar 18, 2019

A high performance and generic framework for distributed DNN training

Python 3,662 494 Updated Oct 3, 2023

DDSP: Differentiable Digital Signal Processing

Python 2,973 347 Updated Sep 23, 2024

An optimizer that trains as fast as Adam and as good as SGD.

Python 2,908 334 Updated Jul 23, 2023

Paper and implementation of UNet-related model.

Python 2,533 506 Updated May 21, 2020

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,490 201 Updated Feb 12, 2025

[IEEE TMI] Official Implementation for UNet++

Python 2,391 546 Updated Jan 11, 2025

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,390 616 Updated Oct 20, 2021

The PyTorch-based audio source separation toolkit for researchers

Python 2,337 429 Updated Jan 11, 2025

Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"

Python 2,186 808 Updated Jun 16, 2022

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,135 164 Updated Feb 13, 2025

Implementation of different kinds of Unet Models for Image Segmentation - Unet , RCNN-Unet, Attention Unet, RCNN-Attention Unet, Nested Unet

Python 1,992 355 Updated Nov 28, 2022

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,570 320 Updated Sep 25, 2024
Next