This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 262 45 Updated Nov 20, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,912 4,730 Updated Aug 16, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 14,044 1,522 Updated Feb 19, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,387 1,437 Updated Feb 13, 2025

maum-ai / voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,119 228 Updated Jul 25, 2024

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,547 175 Updated Nov 7, 2024

UKPLab / EasyNMT

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

Python 1,210 121 Updated Dec 21, 2023

modelscope / KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 500 85 Updated Dec 28, 2023

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,413 763 Updated Feb 21, 2025

ymoslem / OpenNMT-Tutorial

Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.

Jupyter Notebook 157 30 Updated Apr 17, 2024

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,650 1,123 Updated Apr 24, 2024

xserrat / docker-facebook-demucs

Dockerized Facebook Demucs library to make it easy its execution

Makefile 153 44 Updated Nov 24, 2024

fhamborg / news-please

news-please - an integrated web crawler and information extractor for news that just works

Python 2,157 431 Updated Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redbeard-himalaya

Block or report Redbeard-himalaya

Stars

myshell-ai / OpenVoice

eugenesiow / super-image

serengil / deepface

fastai / fastbook

fastai / course22

deepfakes / faceswap

qiuqiangkong / audioset_tagging_cnn

fschmid56 / EfficientAT