Skip to content
View hmongouachon's full-sized avatar

Block or report hmongouachon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
57 stars written in Python
Clear filter

A natural language interface for computers

Python 58,440 5,003 Updated Jan 24, 2025

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,298 7,419 Updated Nov 13, 2024

A generative speech model for daily dialogue.

Python 34,674 3,737 Updated Feb 18, 2025

Make websites accessible for AI agents

Python 32,326 3,314 Updated Feb 24, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 31,050 3,123 Updated Jan 7, 2025

aider is AI pair programming in your terminal

Python 27,854 2,533 Updated Feb 24, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,850 1,331 Updated Feb 22, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,539 604 Updated Feb 24, 2025

tiny vision language model

Python 7,447 577 Updated Feb 23, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 6,023 483 Updated Feb 22, 2025

Agent Zero AI framework

Python 5,883 1,306 Updated Feb 19, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,483 498 Updated Aug 10, 2024

Inference and training library for high-quality TTS models.

Python 5,039 527 Updated Dec 10, 2024

Open Source framework for voice and multimodal conversational AI

Python 4,875 553 Updated Feb 24, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,769 411 Updated Dec 4, 2024

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

Python 3,291 229 Updated Feb 24, 2025

Converts text to speech in realtime

Python 2,578 244 Updated Feb 15, 2025

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 2,368 237 Updated Feb 24, 2025

Python Crypto Bot (PyCryptoBot)

Python 2,007 744 Updated May 21, 2024

Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of!

Python 1,653 815 Updated Jan 14, 2025

A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepg…

Python 912 172 Updated Nov 5, 2024

Desktop AI Assistant powered by o1, o3-mini, GPT-4, GPT-4 Vision, Gemini, Claude, Llama 3, DeepSeek, Bielik, DALL-E, chat, vision, voice control, image generation and analysis, agents, command exec…

Python 878 168 Updated Feb 3, 2025
Python 772 49 Updated Sep 22, 2022

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

Python 749 48 Updated Aug 12, 2024

AlwaysReddy is a LLM voice assistant that is always just a hotkey away.

Python 718 79 Updated Feb 1, 2025

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.

Python 697 95 Updated Feb 2, 2025

Sharing early versions of Ada, a personal AI Assistant built on OpenAIs Realtime API

Python 689 210 Updated Oct 20, 2024

Local SRT/LLM/TTS Voicechat

Python 623 66 Updated Oct 12, 2024

Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs

Python 517 74 Updated Jan 29, 2025
Next