Self-Supervised Speech Pre-training and Representation Learning Toolkit
-
Updated
Nov 16, 2024 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
speech to text with self-supervised learning based on wav2vec 2.0 framework
A live speech recognition using Facebooks wav2vec 2.0 model.
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Training scripts for Speech-To-Text models for Ukrainian language
Wave2vec 2.0 Recognize pipeline
Wav2vec resources and models for Brazilian Portuguese
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Speeech Recognition for Indic languages.
Fine-tuning wav2vec2 to for Pathological Speech Processing
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Building a speaker identification & verification pipeline for Vietnamese voices 😪
A repo to make installation and training of a wav2vec model easier
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
No api-keys | local | llama3.1 For language studying and live translation
Deep audio modeling
Add a description, image, and links to the wav2vec topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec topic, visit your repo's landing page and select "manage topics."