This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…

Python 31 2 Updated Feb 16, 2025

mybigday / whisper.rn

React Native binding of whisper.cpp.

C 466 31 Updated Dec 2, 2024

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,556 176 Updated Nov 7, 2024

oysterlanguage / clearcut

Python 6 Updated Jan 16, 2025

cameronking4 / openai-realtime-api-nextjs

Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & localization. Use starter to build Voice AI apps with WebRTC.

TypeScript 278 50 Updated Jan 25, 2025

EvanBacon / expo-router-forms-components

Free UI components I use for building Expo Router apps

TypeScript 282 13 Updated Feb 15, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,921 1,952 Updated Mar 2, 2025

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,338 182 Updated Sep 26, 2024

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 5,080 570 Updated Mar 3, 2025

remsky / Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 1,796 222 Updated Mar 2, 2025

assafelovic / gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 19,609 2,518 Updated Mar 2, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 69,332 7,452 Updated Mar 3, 2025

painebenjamin / taproot

An open source real-time AI inference engine for seamless scaling

Python 14 2 Updated Feb 28, 2025

bheinzerling / bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,200 103 Updated Oct 1, 2024

SanshruthR / CPU_BlazeChat

Generate text and images using the CPU

Python 6 Updated Feb 17, 2025

UbiquitousLearning / mllm

Fast Multimodal LLM on Mobile Devices

C++ 717 84 Updated Mar 3, 2025

rhasspy / piper

A fast, local neural text to speech system

C++ 8,017 597 Updated Oct 21, 2024

madroidmaq / mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…

Python 261 15 Updated Mar 2, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,509 881 Updated Feb 25, 2025

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 1,969 111 Updated Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ihanif

Achievements

Achievements

Block or report ihanif

Stars

Luthey-Schulten-Lab / CraftCells

Pravko-Solutions / FlashLearn

argilla-io / synthetic-data-generator

adrianlyjak / kokoro-onnx-export

lucasnewman / nanospeech

allenai / OLMoE.swift

EvanBacon / expo-ai

Open-LLM-VTuber / Open-LLM-VTuber

NVIDIA / NeMo-text-processing

superagent-ai / reag

asiff00 / On-Device-Speech-to-Speech-Conversational-AI