Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,428 645 Updated Feb 3, 2025

Chris10M / Lip2Speech

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

Python 79 19 Updated Nov 25, 2021

wavmark / wavmark

AI-based Audio Watermarking Tool

Python 246 32 Updated Jan 7, 2024

smile-struggler / CN-Celeb3_collector

JavaScript 5 1 Updated Jul 5, 2024

facebookresearch / av_hubert

A self-supervised learning framework for audio-visual speech

Python 869 138 Updated Dec 7, 2023

mpc001 / auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Python 298 48 Updated Jan 8, 2025

mylxsw / aidea

AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等，支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。

Dart 6,614 990 Updated Feb 5, 2025

mpc001 / Lipreading_using_Temporal_Convolutional_Networks

ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks

Python 406 103 Updated May 18, 2023

snakers4 / open_stt

Open STT

Python 789 81 Updated Mar 11, 2022

chenjiandongx / bili-spider

📺 B 站全站视频信息爬虫

Python 636 187 Updated Feb 17, 2019

microsoft / NeuralSpeech

Python 1,408 181 Updated Feb 11, 2024

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

542 68 Updated Nov 13, 2024

ruanyf / weekly

科技爱好者周刊，每周五发布

51,941 3,096 Updated Feb 7, 2025

Wechat-ggGitHub / Awesome-GitHub-Repo

收集整理 GitHub 上高质量、有趣的开源项目。

15,590 1,742 Updated Feb 5, 2025

Ryuk17 / Ryuk17

5 Updated Feb 7, 2025

NVIDIA / trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Python 1,569 345 Updated Dec 18, 2024

speechandlanguageprocessing / ICASSP2022-Depression

Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus

Python 148 36 Updated Jul 10, 2023

AmruthPillai / Reactive-Resume

A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!

TypeScript 29,371 2,950 Updated Feb 7, 2025

papers-we-love / papers-we-love

Papers from the computer science community to read and discuss.

Shell 90,614 5,836 Updated Nov 8, 2024

XiaoMi / kaldi-onnx

Kaldi model converter to ONNX

Python 237 57 Updated Jan 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ToughmanL

Achievements

Achievements

Block or report ToughmanL

Stars

jmaczan / asr-dysarthria

VoiceBank-NTPU-TW / VoiceBank-2023

EthanLifeGreat / Qwen2-local-api

TaoRuijie / TalkNet-ASD

pplonski / my_ml_service

mit-han-lab / temporal-shift-module

open-mmlab / mmaction2

myshell-ai / OpenVoice

archinetai / surgeon-pytorch

TencentGameMate / chinese_speech_pretrain

open-mmlab / Amphion