Skip to content
View nikich340's full-sized avatar
  • Russia, Khabarovsk

Organizations

@CM-LG-MSM8226 @CM-LG-F70N

Block or report nikich340

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
60 results for source starred repositories written in Python
Clear filter

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,566 27,563 Updated Jan 14, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,274 8,871 Updated Jan 4, 2025

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,210 8,858 Updated Aug 14, 2024

Deepfakes Software For All

Python 52,865 13,275 Updated Nov 19, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,183 3,254 Updated Aug 17, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,806 4,546 Updated Aug 16, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,570 2,581 Updated Jan 7, 2025

SOTA Open Source TTS

Python 18,316 1,370 Updated Jan 12, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 16,786 2,373 Updated Jan 10, 2025

Faster Whisper transcription with CTranslate2

Python 13,447 1,134 Updated Jan 1, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,380 1,870 Updated Jan 13, 2025

TensorFlow CNN for fast style transfer ⚡🖥🎨🖼

Python 10,935 2,600 Updated Jul 16, 2023

Manipulate audio with a simple and easy high level interface

Python 9,098 1,062 Updated Jul 25, 2024

so-vits-svc fork with realtime support, improved interface and more features.

Python 8,854 1,182 Updated Jan 13, 2025

End-to-End Speech Processing Toolkit

Python 8,677 2,203 Updated Jan 14, 2025

Code for the paper "Jukebox: A Generative Model for Music"

Python 7,891 1,432 Updated Jun 19, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,030 1,284 Updated Dec 6, 2023

Real-Time High-Resolution Background Matting

Python 6,906 954 Updated Jun 19, 2024

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Python 6,120 1,091 Updated Oct 19, 2022

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Python 4,914 397 Updated Jan 6, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,740 456 Updated Dec 26, 2024

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,875 814 Updated Jul 5, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,983 418 Updated May 10, 2023

A python package to analyze and compare voices with deep learning

Python 2,824 434 Updated Oct 12, 2023

This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.

Python 1,194 317 Updated Dec 19, 2020

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,137 225 Updated May 3, 2024

General Speech Restoration

Python 1,066 132 Updated May 31, 2024

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,024 210 Updated Oct 23, 2024

Large-scale pretrained models for goal-directed dialog

Python 862 112 Updated Dec 10, 2023

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 672 152 Updated Jul 12, 2022
Next