-
Nanyang Technological University
-
02:37
(UTC +08:00)
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Instant voice cloning by MIT and MyShell. Audio foundation model.
idiap / coqui-ai-TTS
Forked from coqui-ai/TTSπΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Scripts and classes used for testing models for speech and text emotion recognition introduced in my bachelor thesis
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Sound analysis/synthesis tools for music applications
This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Deep Speaker: an End-to-End Neural Speaker Embedding System.
Bangla cleaned speech corpus, specially developed for Bangla Text to Speech
π A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feature computations & data augmentations.
π The Design Checklist for Creative Web Designers and Patient Front-End Developers
π― The most essential list of resources for Front-End beginners (πΊπΈ & π«π·)
π Study guide and introduction to the modern front end stack.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) lβ¦
A timeline of the latest AI models for audio generation, starting in 2023!
RedHenLab / TalkNet-ASD
Forked from TaoRuijie/TalkNet-ASDACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Improved Wave-U-Net implemented in Pytorch
Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
Natural Language Processing Tutorial for Deep Learning Researchers
30 days of JavaScript programming challenge is a step-by-step guide to learn JavaScript programming language in 30 days. This challenge may take more than 100 days, please just follow your own pace.