Skip to content
View francklinson's full-sized avatar
  • China.Hangzhou

Block or report francklinson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
65 results for source starred repositories written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 138,383 27,771 Updated Jan 31, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,382 9,013 Updated Jan 4, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,632 4,450 Updated Jan 18, 2025

TensorFlow code and pre-trained models for BERT

Python 38,583 9,656 Updated Jul 23, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,273 4,649 Updated Aug 16, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,868 6,448 Updated Jan 9, 2025

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,243 1,423 Updated Dec 9, 2024

SOTA Open Source TTS

Python 18,741 1,416 Updated Jan 26, 2025

Faster Whisper transcription with CTranslate2

Python 13,783 1,153 Updated Jan 1, 2025

Python Implementation of Reinforcement Learning: An Introduction

Python 13,780 4,857 Updated Aug 9, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,614 1,592 Updated Jan 13, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,612 1,479 Updated Jan 27, 2025

A PyTorch-based Speech Toolkit

Python 9,286 1,425 Updated Jan 31, 2025

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python 9,035 5,025 Updated Mar 31, 2024

End-to-End Speech Processing Toolkit

Python 8,730 2,212 Updated Jan 30, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,397 640 Updated Jan 23, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,871 821 Updated Jan 24, 2025

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,645 532 Updated Jul 10, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,616 650 Updated Aug 13, 2024

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,505 1,706 Updated Apr 25, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,079 1,293 Updated Dec 6, 2023

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,283 1,099 Updated Jan 10, 2025

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,882 816 Updated Jul 5, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,579 313 Updated Jan 4, 2024

pytorch tutorial for beginners

Python 3,003 1,088 Updated Feb 12, 2022

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,780 187 Updated Nov 14, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,138 151 Updated Jan 27, 2025

Evolutionary algorithm toolbox and framework with high performance for Python

Python 2,048 728 Updated Jan 17, 2025

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,983 252 Updated Jan 13, 2025

faster_whisper GUI with PySide6

Python 1,967 117 Updated Dec 8, 2024
Next