Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

zw76859420 Follow

Overview Repositories 133 Projects 0 Packages 0 Stars 823

More

Overview
Repositories
Projects
Packages
Stars

zw76859420

Follow

🎯

Focusing

Shylock zw76859420

🎯

Focusing

Follow

https://www.meta-speech.com

207 followers · 356 following

Trip
Shanghai
02:10 (UTC +08:00)
[email protected]
@shylockasr
https://www.meta-speech.com

Achievements

Achievements

Block or report zw76859420

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 133 Projects 0 Packages 0 Stars 823

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python Jupyter Notebook C++ Shell HTML C Cuda

Sort Last updated

Select order

Last updated Name Stars

3D-Speaker Public
Forked from modelscope/3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python Apache License 2.0 Updated Dec 24, 2024
Bert-VITS2 Public
Forked from fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

Python GNU Affero General Public License v3.0 Updated Dec 23, 2024
moshi Public
Forked from kyutai-labs/moshi

Python Apache License 2.0 Updated Sep 19, 2024
returnn-experiments Public
Forked from rwth-i6/returnn-experiments

RWTH Aachen University, Germany(Hermann Ney)

Python 1 Updated Sep 13, 2024
Speech2Unit Public
Forked from nervjack2/Speech2Unit

Python Updated Aug 22, 2024
speechbrain Public
Forked from speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Python Apache License 2.0 Updated Aug 21, 2024
icefall Public
Forked from k2-fsa/icefall

Python Apache License 2.0 Updated Aug 21, 2024
ASR_Theory Public

语音识别理论、论文和PPT

tensorflow keras deeplearning papers kaldi asr k2

590 185 GNU General Public License v3.0 Updated Aug 7, 2024
speechllm Public
Forked from wenet-e2e/west

We Speech Transcript based on LLM, in 300 lines of code.

Python Apache License 2.0 Updated Aug 6, 2024
AIF-PyTorch Public
Forked from TeaPoly/AIF-PyTorch

(NOT Official) Implementation Auto-regressive Integrate-and-Fire (AIF)

Python Updated Dec 11, 2023
best-rq-pytorch Public
Forked from lucasnewman/best-rq-pytorch

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Python MIT License Updated Sep 25, 2023
whisper-finetune Public
Forked from yfliao/whisper-hakka

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python MIT License Updated Jul 30, 2023
cudafst Public
Forked from nvidia-riva/riva-asrlib-decoder

Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva

Python Updated Jun 18, 2023
NeuralSpeech Public
Forked from microsoft/NeuralSpeech

Python 2 MIT License Updated Mar 19, 2023
speech-to-speech-translation Public
Forked from fengpeng-yue/speech-to-speech-translation

S2ST 伪标签

Python MIT License Updated Feb 12, 2023
GigaS2S Public
Forked from SpeechTranslation/GigaS2S

S2ST Data

Creative Commons Attribution 4.0 International Updated Jan 22, 2023
CTC-OptimizedLoss Public
Forked from TeaPoly/CTC-OptimizedLoss

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

Python Updated Dec 9, 2022
whisper Public
Forked from openai/whisper

Jupyter Notebook MIT License Updated Sep 23, 2022
neurst Public
Forked from bytedance/neurst

Neural end-to-end Speech Translation Toolkit

Python Other Updated Jun 28, 2022
gfcc Public
Forked from CoESML/gfcc-speech-kaldi

gfcc features

C++ MIT License Updated Apr 30, 2022
emoASR Public
Forked from emonosuke/emoASR

End-to-end MOdeling of ASR (Automatic Speech Recognition)

Python Updated Apr 29, 2022
KWS_pytorch Public
Forked from hongfeixue/KWS_pytorch

Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM

Python Updated Mar 15, 2022
SimilarCharacter Public
Forked from contr4l/SimilarCharacter

对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字

Python MIT License Updated Feb 13, 2022
ksponspeech Public
Forked from sooftware/ksponspeech

Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.

Python MIT License Updated Dec 24, 2021
sentencepiece Public
Forked from google/sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ Apache License 2.0 Updated Dec 21, 2021
SparseSelfAttention Public

Sparse Attention Mechanism, accepted in KSC 2019

Python 1 Updated Nov 2, 2021
bert Public
Forked from google-research/bert

TensorFlow code and pre-trained models for BERT

Python 1 Apache License 2.0 Updated Sep 11, 2021
Bayesian_TDNN Public
Forked from skhu101/Bayesian_TDNN

This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition"

C++ Updated Aug 2, 2021
asr-decode-simple Public
Forked from Ma-Dan/asr-decode

从Kaldi中裁剪的轻量级语音识别解码推理框架，目前实现了MFCC+GMM+Viterbi，不依赖OpenFST、OpenBLAS等库

C++ Apache License 2.0 Updated Jul 31, 2021
snowfall Public
Forked from k2-fsa/snowfall

Python Apache License 2.0 Updated Jun 10, 2021

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.