-
Trip
- Shanghai
-
02:10
(UTC +08:00) - [email protected]
- @shylockasr
- https://www.meta-speech.com
-
3D-Speaker Public
Forked from modelscope/3D-SpeakerA Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Python Apache License 2.0 UpdatedDec 24, 2024 -
Bert-VITS2 Public
Forked from fishaudio/Bert-VITS2vits2 backbone with multilingual-bert
Python GNU Affero General Public License v3.0 UpdatedDec 23, 2024 -
-
returnn-experiments Public
Forked from rwth-i6/returnn-experimentsRWTH Aachen University, Germany(Hermann Ney)
-
-
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
Python Apache License 2.0 UpdatedAug 21, 2024 -
-
ASR_Theory Public
语音识别理论、论文和PPT
-
speechllm Public
Forked from wenet-e2e/westWe Speech Transcript based on LLM, in 300 lines of code.
Python Apache License 2.0 UpdatedAug 6, 2024 -
AIF-PyTorch Public
Forked from TeaPoly/AIF-PyTorch(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)
Python UpdatedDec 11, 2023 -
best-rq-pytorch Public
Forked from lucasnewman/best-rq-pytorchImplementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
Python MIT License UpdatedSep 25, 2023 -
whisper-finetune Public
Forked from yfliao/whisper-hakkaFine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Python MIT License UpdatedJul 30, 2023 -
cudafst Public
Forked from nvidia-riva/riva-asrlib-decoderStandalone implementation of the CUDA-accelerated WFST Decoder available in Riva
Python UpdatedJun 18, 2023 -
-
speech-to-speech-translation Public
Forked from fengpeng-yue/speech-to-speech-translationS2ST 伪标签
Python MIT License UpdatedFeb 12, 2023 -
GigaS2S Public
Forked from SpeechTranslation/GigaS2SS2ST Data
Creative Commons Attribution 4.0 International UpdatedJan 22, 2023 -
CTC-OptimizedLoss Public
Forked from TeaPoly/CTC-OptimizedLossComputes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
Python UpdatedDec 9, 2022 -
-
neurst Public
Forked from bytedance/neurstNeural end-to-end Speech Translation Toolkit
Python Other UpdatedJun 28, 2022 -
-
emoASR Public
Forked from emonosuke/emoASREnd-to-end MOdeling of ASR (Automatic Speech Recognition)
Python UpdatedApr 29, 2022 -
KWS_pytorch Public
Forked from hongfeixue/KWS_pytorchKeyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM
Python UpdatedMar 15, 2022 -
SimilarCharacter Public
Forked from contr4l/SimilarCharacter对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字
Python MIT License UpdatedFeb 13, 2022 -
ksponspeech Public
Forked from sooftware/ksponspeechPre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
Python MIT License UpdatedDec 24, 2021 -
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedDec 21, 2021 -
SparseSelfAttention Public
Sparse Attention Mechanism, accepted in KSC 2019
-
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
-
Bayesian_TDNN Public
Forked from skhu101/Bayesian_TDNNThis repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition"
C++ UpdatedAug 2, 2021 -
asr-decode-simple Public
Forked from Ma-Dan/asr-decode从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库
C++ Apache License 2.0 UpdatedJul 31, 2021 -