Starred repositories
Command-line program to download videos from YouTube.com and other video sites
Robust Speech Recognition via Large-Scale Weak Supervision
Clone a voice in 5 seconds to generate arbitrary speech in real-time
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Magenta: Music and Art Generation with Machine Intelligence
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
π Scalable embedding, reasoning, ranking for images and sentences with CLIP
An open source implementation of CLIP.
TensorFlow-based neural network library
Manipulate audio with a simple and easy high level interface
Implementation of Graph Convolutional Networks in TensorFlow
Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
π TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
A tool for extracting plain text from Wikipedia dumps
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
π€ Build voice-based LLM agents. Modular + open source.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)