pengyizhou

pengyizhou

10 followers · 3 following

https://orcid.org/0000-0002-5718-570X

Achievements

Highlights

Stars

PacktPublishing / R-Bioinformatics-Cookbook

R Bioinformatics Cookbook, published by Packt

HTML 110 71 Updated Jan 30, 2023

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,237 2,334 Updated Feb 19, 2025

ntu-nail / CE7455

Jupyter Notebook 36 8 Updated Feb 19, 2025

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

263 16 Updated Nov 28, 2024

gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,625 181 Updated Jan 16, 2025

thunlp / duplex-model

TypeScript 33 4 Updated Aug 17, 2024

ddlBoJack / Awesome-Speech-Language-Model

Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.

156 13 Updated Nov 10, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,669 216 Updated Dec 5, 2024

YUCHEN005 / STAR-Adapt

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"

Python 239 3 Updated May 24, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,247 163 Updated Feb 14, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,506 3,724 Updated Feb 18, 2025

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,471 303 Updated Jan 7, 2025

kyutai-labs / sphn

python bindings for symphonia/opus - read various audio formats from python and write opus files

Rust 30 4 Updated Dec 22, 2024

Emrys365 / fairseq

Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 1 Updated Jul 8, 2024

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,511 600 Updated Feb 19, 2025

amphionspace / SD-Eval

[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Python 48 1 Updated Jun 25, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 10,807 1,056 Updated Feb 16, 2025

lencx / Noi

🚀 Power Your World with AI - Explore, Extend, Empower.

JavaScript 7,141 530 Updated Feb 10, 2025

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,597 115 Updated Jul 5, 2024

nullscc / nullscc.github.io

HTML 8 Updated Dec 11, 2023

chrisballinger / OpenCallBlock

iOS CallKit blocking of NPA-NXX number prefix spam

Swift 76 23 Updated Dec 1, 2018

RayWangQvQ / BiliBiliToolPro

B 站（bilibili）自动任务工具，支持docker、青龙、k8s等多种部署方式。敏感肌也能用。

C# 6,840 1,820 Updated Feb 16, 2025

lencx / ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 53,627 6,060 Updated Aug 29, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,151 2,683 Updated Feb 19, 2025

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,166 467 Updated Aug 22, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,244 858 Updated Feb 18, 2025

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,984 4,357 Updated Aug 19, 2024

asteroid-team / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 999 91 Updated Jan 15, 2025

iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,951 194 Updated Feb 18, 2025

sczhou / CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 16,491 3,449 Updated Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pengyizhou

Achievements

Achievements

Highlights

Block or report pengyizhou

Stars

PacktPublishing / R-Bioinformatics-Cookbook

meta-llama / llama-cookbook

ntu-nail / CE7455

jishengpeng / WavChat

gpt-omni / mini-omni2

thunlp / duplex-model

ddlBoJack / Awesome-Speech-Language-Model

THUDM / GLM-4-Voice

YUCHEN005 / STAR-Adapt

modelscope / ClearerVoice-Studio

2noise / ChatTTS

ufal / whisper_streaming

kyutai-labs / sphn

Emrys365 / fairseq

kyutai-labs / moshi

amphionspace / SD-Eval

FunAudioLLM / CosyVoice

lencx / Noi

QwenLM / Qwen-Audio

nullscc / nullscc.github.io

chrisballinger / OpenCallBlock

RayWangQvQ / BiliBiliToolPro

lencx / ChatGPT

NVIDIA / NeMo

modelscope / FunClip

modelscope / FunASR

suno-ai / bark

asteroid-team / torch-audiomentations

iver56 / audiomentations

sczhou / CodeFormer