-
EZAI Corp., NTNU SMIL LAB
- Taipei, Taiwan
- https://www.linkedin.com/in/jiun-ting-li/
- https://www.yannyann.com
-
CTC-based-GOP Public
Forked from frank613/CTC-based-GOPThis repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
Python UpdatedDec 16, 2024 -
ppgs Public
Forked from interactiveaudiolab/ppgsHigh-Fidelity Neural Phonetic Posteriorgrams
Python MIT License UpdatedDec 10, 2024 -
kaldi Public
Forked from kaldi-asr/kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Shell Other UpdatedNov 26, 2024 -
audio-annotator Public
Forked from vell001/audio-annotatorAudio-annotator
JavaScript UpdatedNov 15, 2024 -
syllabificator Public
Forked from guslatho/syllabificatorTool for syllabificating (dividing words into syllables) Dutch or English words. Employs recent high-performance algorithms.
Python GNU General Public License v3.0 UpdatedAug 29, 2024 -
fluency_scorer Public
Forked from tangYang7/fluency_scorerIt's unofficial implementation for speech fluency assessment model
Python UpdatedAug 29, 2024 -
PyToBI Public
Forked from monikaUPF/PyToBIA Toolkit for ToBI Labeling with Python Data Structures
Praat GNU General Public License v3.0 UpdatedAug 18, 2024 -
pykaldi Public
Forked from pykaldi/pykaldiA Python wrapper for Kaldi
Python Apache License 2.0 UpdatedAug 15, 2024 -
asa-with-limited-data Public
Forked from lunsanna/asa-with-limited-dataJupyter Notebook UpdatedJul 16, 2024 -
-
multipa Public
Forked from ctaguchi/multipaUniversal multilingual automatic speech transcription into IPA
Python UpdatedJun 6, 2024 -
-
-
fac-via-ppg Public
Forked from guanlongzhao/fac-via-ppgForeign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
-
local.vad.whisper.s2t Public
Using Whisper (from openai) to decode speech to text
Python Apache License 2.0 UpdatedDec 30, 2022 -
-
-
Stock-Price-Prediction-support Public
Forked from Brianc0927/Stock-Price-PredictionPython UpdatedJul 27, 2022 -
is2021_feature_extractor_v2 Public
Instead of posterior probability of recognized tokens, we use GOP scores as the token's confidence scores
-
larnmvp Public
It an unattended script shell written by bash command for installing Apache 2.4, Nginx 1.17, Varnish 6, Redis 5, PHP 7.4 running on Ubuntu 18.04 LTS
-
w2v_hubert_feats_extractor Public
Retrieve speech representation from Wav2vec and HuBERT pre-trained models
Shell MIT License UpdatedJun 17, 2022 -
is2021_feature_extractor Public
Generating acoustic phonetic features
Python MIT License UpdatedJun 7, 2022 -
local_for_is2021 Public
Some correction to the annotation of tltschool
Shell MIT License UpdatedJun 4, 2022 -
Generating disfluency features from pretrained model
Python MIT License UpdatedJun 2, 2022 -
myprosody Public
Forked from Shahabks/myprosodyA Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Python MIT License UpdatedMay 16, 2022 -
-
HETFORMER Public
Forked from yeliu918/HETFORMERThis is the repository of Heterogeneous Transformer with Sparse Attention forLong-Text Extractive Summarization
Python UpdatedNov 23, 2021 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedFeb 6, 2021 -
Principles-of-Machine-Learning-Python Public
Forked from datapython/Principles-of-Machine-Learning-PythonPrinciples of Machine Learning Python
Jupyter Notebook UpdatedFeb 5, 2021 -
neural_sp Public
Forked from hirofumi0810/neural_spEnd-to-end ASR/LM implementation with PyTorch
Python Apache License 2.0 UpdatedFeb 3, 2021