This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models

Jupyter Notebook 903 405 Updated Mar 7, 2025

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 652 118 Updated Apr 11, 2024

philipperemy / deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 918 241 Updated Apr 13, 2024

Bangla-Language-Processing / Bangla-Speech-Corpora

Bangla cleaned speech corpus, specially developed for Bangla Text to Speech

3 3 Updated Feb 8, 2020

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,315 142 Updated Jun 6, 2024

salesforce / speech-datasets

Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feature computations & data augmentations.

Python 15 4 Updated Jun 12, 2023

GovTechSG / sgds-govtech-react

React components for SGDS

TypeScript 16 8 Updated Feb 23, 2025

thedaviddias / Front-End-Design-Checklist

💎 The Design Checklist for Creative Web Designers and Patient Front-End Developers

5,046 387 Updated Dec 10, 2024

thedaviddias / Resources-Front-End-Beginner

💯 The most essential list of resources for Front-End beginners (🇺🇸 & 🇫🇷)

4,098 409 Updated Dec 21, 2024

grab / front-end-guide

📚 Study guide and introduction to the modern front end stack.

JavaScript 15,200 1,124 Updated Jun 12, 2023

UniversalDependencies / UD_English-EWT

English data

Python 205 42 Updated Mar 5, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,688 5,070 Updated Jan 22, 2025

sigsep / bsseval

audio source separation evaluation metrics

Python 29 11 Updated Aug 27, 2019

google / visqol

Perceptual Quality Estimator for speech and audio

C++ 740 130 Updated Aug 2, 2024

microsoft / MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 512 149 Updated Jul 1, 2024

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,899 71 Updated Jan 4, 2024

RedHenLab / TalkNet-ASD

Forked from TaoRuijie/TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python 1 Updated Oct 23, 2023

JavaScript 6 1 Updated Nov 3, 2022

Clarence ClarenceTKX

Highlights

Lists (3)

fyp

NLP

webdev

Stars