-
DeepSpeaker-pytorch Public
Forked from qqueing/DeepSpeaker-pytorchSpeaker embedding(verification and recognition) using Pytorch
Python MIT License UpdatedJul 28, 2024 -
FAE-CV Public
This repo contains the reicpe to assemble a corpus for Foreign Accented English using the crowdsourced corpus Common Voice which contains (optional) accent labels.
-
-
websocket-bridge Public
Forked from nvidia-riva/websocket-bridgeWebsockets <-> Riva proxy service. Audiocodes compatible.
JavaScript MIT License UpdatedSep 2, 2022 -
build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2Face Public
Forked from metaiintw/build-an-avatar-with-ASR-TTS-Transformer-Omniverse-Audio2FaceJupyter Notebook UpdatedJun 9, 2022 -
FishBoardMix Public
The FishBoardMix corpus is designed to explore Speaker-Age estimation technology.
-
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
Python Apache License 2.0 UpdatedSep 12, 2021 -
Speech-Emotion-Recognition Public
Forked from Renovamen/Speech-Emotion-RecognitionSpeech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Python MIT License UpdatedMar 18, 2021 -
mcr2 Public
Forked from ryanchankh/mcr2Official Implementation of Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction (2020)
Python UpdatedFeb 18, 2021 -
kaldi Public
Forked from kaldi-asr/kaldiThis is the official location of the Kaldi project.
Shell Other UpdatedOct 30, 2019 -
pytorch-kaldi Public
Forked from mravanelli/pytorch-kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…
Python UpdatedSep 10, 2019 -
speaker-embedding-with-phonetic-information Public
Forked from mycrazycracy/speaker-embedding-with-phonetic-informationThe code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
-
TasNet Public
Forked from kaituoxu/TasNetA PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Python UpdatedJan 27, 2019 -
XXX-blockchain-starter-kit Public
Created for toolchain: https://console.ng.bluemix.net/devops/toolchains/b981acec-b692-43ec-b168-0e02169b75b2?env_id=ibm%3Ayp%3Aus-south
Shell Apache License 2.0 UpdatedApr 26, 2018