Skip to content
View ihanif's full-sized avatar

Block or report ihanif

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Visualizing Cell Structures with Minecraft

Jupyter Notebook 44 2 Updated Feb 8, 2025

Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.

Python 570 39 Updated Feb 21, 2025

Build datasets using natural language

Python 401 45 Updated Feb 21, 2025
Python 11 Updated Feb 23, 2025

A simple, hackable text-to-speech system in PyTorch and MLX

Python 124 11 Updated Feb 23, 2025
Swift 227 25 Updated Feb 15, 2025

AI chat app built with Expo Router

TypeScript 505 79 Updated Feb 13, 2025

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 2,480 250 Updated Mar 2, 2025

NeMo text processing for ASR and TTS

Python 313 102 Updated Feb 28, 2025

Reasoning Augmented Generation

Python 746 50 Updated Feb 12, 2025

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…

Python 31 2 Updated Feb 16, 2025

React Native binding of whisper.cpp.

C 466 31 Updated Dec 2, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,556 176 Updated Nov 7, 2024
Python 6 Updated Jan 16, 2025

Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & localization. Use starter to build Voice AI apps with WebRTC.

TypeScript 278 50 Updated Jan 25, 2025

Free UI components I use for building Expo Router apps

TypeScript 282 13 Updated Feb 15, 2025

Fully open reproduction of DeepSeek-R1

Python 21,921 1,952 Updated Mar 2, 2025

Simple text to phones converter for multiple languages

Python 1,338 182 Updated Sep 26, 2024

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 5,080 570 Updated Mar 3, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 1,796 222 Updated Mar 2, 2025

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Python 19,609 2,518 Updated Mar 2, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 69,332 7,452 Updated Mar 3, 2025

An open source real-time AI inference engine for seamless scaling

Python 14 2 Updated Feb 28, 2025

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

Python 1,200 103 Updated Oct 1, 2024

Generate text and images using the CPU

Python 6 Updated Feb 17, 2025

Fast Multimodal LLM on Mobile Devices

C++ 717 84 Updated Mar 3, 2025

A fast, local neural text to speech system

C++ 8,017 597 Updated Oct 21, 2024

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…

Python 261 15 Updated Mar 2, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,509 881 Updated Feb 25, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 1,969 111 Updated Feb 20, 2025
Next