Stars
Turn any glasses into AI-powered smart glasses
Industry leading face manipulation platform
Foundational model for human-like, expressive TTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
The official Python API for ElevenLabs Text to Speech.
An Open Source text-to-speech system built by inverting Whisper.
A fluent API to FFMPEG (http://www.ffmpeg.org)
GeoLite2-City.mmdb.gz CDN files based on Free Open Source CDN jsDelivr!
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Instant voice cloning by MIT and MyShell. Audio foundation model.
A natural language interface for computers
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
Native port of Redis for Windows. Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Se…
Robust Speech Recognition via Large-Scale Weak Supervision
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
基于原版 frp 内网穿透服务端 frps 的一键安装卸载脚本和 docker 镜像.支持 Linux 服务器和 docker 等多种环境安装部署.
Emscripten: An LLVM-to-WebAssembly Compiler
Experimental Three.js WASM (WIP)
A high-performance, secure, extensible, and OCI-complaint JavaScript runtime for WasmEdge.
An open source, portable, easy to use, readable and flexible TLS library, and reference implementation of the PSA Cryptography API. Releases are on a varying cadence, typically around 3 - 6 months …