-
nikans.com
- nikans.com
Stars
📖 A curated list of resources dedicated to talking face.
A tokenizer, text cleaner, and phonemizer for many human languages.
Chrome Extensions Samples
Grapheme to phoneme conversion with deep learning.
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
Convert phoneme codes and lexicon formats for English speech synths
Simple text to phones converter for multiple languages
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Types and functions that make it a little easier to work with Core ML in Swift.
Convert a series of images to video with audio
a curated list of speech datasets (110+ datasets, 75+ easy to download)
This repository contains the code to replicate the experiments in the paper "Mispronunciation detection using self-supervised speech representations" presented at SLaTE 2023
[Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
End-to-End Mispronunciation Detection via wav2vec2.0
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Robust Speech Recognition via Large-Scale Weak Supervision
edna iOS sdk release libraries and demo project
Simple SwiftUI App. The design was inspired be default Apple Calendar
A small library that adds concurrency to CoreBluetooth APIs.
A wrapper API for CoreBluetooth using Combine Publishers