zycv

Follow

🐂

ZY zycv

🐂

Follow

Software and Algorithm Engineer

22 followers · 20 following

China
Beijing

Achievements

Achievements

Organizations

Starred repositories

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

262 16 Updated Nov 28, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,975 6,460 Updated Jan 9, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

27,987 2,307 Updated Jun 18, 2024

TylerYep / torchinfo

View model summaries in PyTorch!

Python 2,689 124 Updated Feb 10, 2025

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 824 126 Updated Jan 6, 2025

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,603 672 Updated Feb 15, 2025

csukuangfj / kaldi_native_io

python wrapper for kaldi's native I/O

C++ 27 3 Updated Jan 9, 2025

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,303 1,101 Updated Feb 10, 2025

louisfb01 / best_AI_papers_2021

A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.

2,904 238 Updated Oct 18, 2023

zycv / awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

253 39 Updated May 23, 2022

d2l-ai / d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 65,770 11,259 Updated Jul 30, 2024

yufan-aslp / AliMeeting

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recogniti…

Python 117 18 Updated Jun 10, 2022

k2-fsa / icefall

Python 1,011 310 Updated Feb 4, 2025

wenet-e2e / wekws

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 502 119 Updated Feb 13, 2025

openspeech-team / openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 690 115 Updated Oct 23, 2023

mli / paper-reading

深度学习经典、新论文逐段精读

28,210 2,509 Updated Nov 17, 2024

sipeed / Maix-Speech

Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.

Python 331 58 Updated Sep 28, 2022

ZhengkunTian / OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Python 375 66 Updated Jul 21, 2022

k2-fsa / k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,165 219 Updated Feb 8, 2025

deepaudio / deepaudio-speaker

neural network based speaker embedder

Python 25 5 Updated Jan 7, 2023

zycv / OpenSpeaker

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

C++ 63 13 Updated Feb 16, 2022

wenet-e2e / WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Shell 517 49 Updated Jul 3, 2023

wq2012 / SpectralCluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 523 72 Updated Sep 25, 2024

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,609 3,049 Updated Feb 16, 2025

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 662 62 Updated Feb 26, 2024

microsoft / GSL

Guidelines Support Library

C++ 6,299 743 Updated Feb 14, 2025

lynnboy / CppCoreGuidelines-zh-CN

Translation of C++ Core Guidelines [https://github.com/isocpp/CppCoreGuidelines] into Simplified Chinese.

2,248 314 Updated Feb 6, 2025

Alinshans / MyTinySTL

Achieve a tiny STL in C++11

C++ 11,691 3,281 Updated Oct 27, 2024

isocpp / CppCoreGuidelines

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 43,270 5,455 Updated Jan 16, 2025

lixucuhk / ASV-anti-spoofing-with-Res2Net

Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.15006

Shell 78 15 Updated Oct 21, 2021

Starred topics

speech-processing

keyword-spotting