jd3655

Vector Ventures jd3655

14 followers · 61 following

Achievements

Stars

jjunak-yun / FLowHigh_code

Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"

Python 26 3 Updated Dec 2, 2024

Standard-Intelligence / hertz-dev

first base model for full-duplex conversational audio

Python 1,649 106 Updated Nov 12, 2024

edwko / OuteTTS

Interface for OuteTTS models.

Python 762 59 Updated Dec 14, 2024

youngsheen / GPST

[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer

Python 40 2 Updated Nov 1, 2024

BUTSpeechFIT / TS-ASR-Whisper

18 Updated Sep 19, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,449 198 Updated Dec 5, 2024

gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,673 199 Updated Nov 6, 2024

Mihaiii / backtrack_sampler

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 114 8 Updated Oct 29, 2024

rhymes-ai / Aria

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 901 74 Updated Dec 12, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 7,981 1,009 Updated Dec 14, 2024

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,165 319 Updated Nov 13, 2024

xiaoxue1117 / speech-mamba-public

Python 10 1 Updated Nov 26, 2024

yl4579 / StyleTTS-ZS

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

162 10 Updated Sep 27, 2024

WangHelin1997 / SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 102 10 Updated Nov 1, 2024

thuhcsi / VoxInstruct

VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling

Python 49 3 Updated Nov 9, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 492 35 Updated Oct 17, 2024

OpenT2S / LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 224 12 Updated Aug 26, 2024

ex3ndr / supervoice-hybrid

My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one

Jupyter Notebook 27 2 Updated Aug 5, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 387 36 Updated Dec 3, 2024

adelacvg / detail_tts

All generative model in one for better TTS model

Python 65 8 Updated Sep 8, 2024

winddori2002 / DEX-TTS

DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability

Python 95 8 Updated Nov 1, 2024

zjlww / ardit-web

HTML 25 1 Updated Aug 2, 2024

ex3ndr / supervoice-vall-e-2

VALL-E 2 reproduction

Jupyter Notebook 102 14 Updated Jul 14, 2024

ditto-tts / ditto-tts.github.io

Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

HTML 31 1 Updated Aug 21, 2024

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 1,587 107 Updated Dec 12, 2024

Plachtaa / FAcodec

Training code for FAcodec presented in NaturalSpeech3

Python 183 18 Updated Aug 26, 2024

ex3ndr / supervoice-flow

SpeechFlow neural network implementation

Jupyter Notebook 18 Updated Aug 8, 2024

ex3ndr / supervoice-enhance

Supervoice diffusion enhance

Jupyter Notebook 25 Updated Jul 15, 2024

zhenye234 / FlashSpeech

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Python 101 5 Updated Sep 20, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,108 88 Updated Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector Ventures jd3655

Achievements

Achievements

Block or report jd3655

Stars

jjunak-yun / FLowHigh_code

Standard-Intelligence / hertz-dev

edwko / OuteTTS

youngsheen / GPST

BUTSpeechFIT / TS-ASR-Whisper

THUDM / GLM-4-Voice

gpt-omni / mini-omni2

Mihaiii / backtrack_sampler

rhymes-ai / Aria

SWivid / F5-TTS

xjdr-alt / entropix

xiaoxue1117 / speech-mamba-public

yl4579 / StyleTTS-ZS

WangHelin1997 / SSR-Speech

thuhcsi / VoxInstruct

FireRedTeam / FireRedTTS

OpenT2S / LlamaVoice

ex3ndr / supervoice-hybrid

lucidrains / e2-tts-pytorch

adelacvg / detail_tts

winddori2002 / DEX-TTS

zjlww / ardit-web

ex3ndr / supervoice-vall-e-2

ditto-tts / ditto-tts.github.io

fixie-ai / ultravox

Plachtaa / FAcodec

ex3ndr / supervoice-flow

ex3ndr / supervoice-enhance

zhenye234 / FlashSpeech

Alpha-VLLM / Lumina-T2X