UniSpeech - Large Scale Self-Supervised Learning for Speech
-
Updated
Apr 5, 2024 - Python
UniSpeech - Large Scale Self-Supervised Learning for Speech
The dataset of Speech Recognition
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
A demo to show Speech Diarization (seperating audio of different speaker) and converting them to text using Google Cloud Speech API.
Speech transcription and speech diarization
Template Project For iOS Apps using .onnx Speech Models
Add a description, image, and links to the speech-diarization topic page so that developers can more easily learn about it.
To associate your repository with the speech-diarization topic, visit your repo's landing page and select "manage topics."