Stars
Clone a voice in 5 seconds to generate arbitrary speech in real-time
The fundamental package for scientific computing with Python.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or exte…
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Manipulate audio with a simple and easy high level interface
all kinds of text classification models and more with deep learning
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network
Sequence to Sequence Learning with Keras
A python package to analyze and compare voices with deep learning
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
DeepMind's Tacotron-2 Tensorflow implementation
Solves basic Russian NLP tasks, API for lower level Natasha projects
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Morphological analyzer / inflection engine for Russian and Ukrainian languages.
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
The Implementation of FastSpeech based on pytorch.
Implementation of Super Resolution CNN in Keras.
Framework for building complex recurrent neural networks with Keras
Google Drive API Python wrapper library. Maintained fork of PyDrive.
patool is a portable command line archive file manager
Phoneme multilingual(Russian-English) voice cloning based on
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network