Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 17,278 839 Updated Dec 15, 2024

rhasspy / piper

A fast, local neural text to speech system

C++ 6,971 510 Updated Oct 21, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 12,972 1,085 Updated Dec 12, 2024

CrazyBoyM / llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

Python 4,075 337 Updated Sep 16, 2024

Kiteretsu77 / APISR

APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)

Python 906 61 Updated Jun 28, 2024

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,723 586 Updated Jul 2, 2024

upscayl / upscayl

🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.

TypeScript 31,754 1,464 Updated Dec 15, 2024

Mozilla-Ocho / llamafile

Distribute and run LLMs with a single file.

C++ 20,884 1,068 Updated Dec 14, 2024

google / gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,046 517 Updated Dec 13, 2024

gmn / nanotts

Improved SVOX PicoTTS speech synthesizer

C 103 24 Updated Apr 25, 2021

ggerganov / llama.cpp

LLM inference in C/C++

C++ 69,277 9,968 Updated Dec 15, 2024

huakunyang / SummerAsr

SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be easily built standalone without any depencency.

C++ 85 10 Updated Dec 14, 2024

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,291 391 Updated Dec 16, 2024

k2-fsa / sherpa

Speech-to-text server framework with next-gen Kaldi

C++ 570 110 Updated Dec 13, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,180 556 Updated Oct 19, 2024

thijsvanloef / palworld-server-docker

A Docker Container to easily run a Palworld dedicated server.

Shell 2,445 298 Updated Dec 16, 2024

guoqincode / Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Python 2,965 239 Updated Jul 9, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,922 597 Updated Nov 30, 2024

linexjlin / GPTs

leaked prompts of GPTs

28,973 3,925 Updated Sep 27, 2024

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,532 974 Updated Jul 26, 2024

PlayVoice / vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,168 167 Updated Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cqz mega-cqz

Block or report mega-cqz

Stars

grantjenks / py-tree-sitter-languages

hacksider / Deep-Live-Cam

hiyouga / LLaMA-Factory

openai / whisper

huggingface / text-generation-inference

naklecha / llama3-from-scratch

NaruseMioShirakana / DragonianVoice

SiTH-Diffusion / SiTH

2noise / ChatTTS

khoj-ai / khoj