Stars
📝 A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A SDK to using the Realtime API with Microcontrollers like the ESP32
A simple, open source bilingual translation extension & Greasemonkey script (一个简约、开源的 双语对照翻译扩展 & 油猴脚本)
Running Interactive Brokers gateway on Raspberry 4B and Debian 64-bit
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Docker image with IB Gateway/TWS and IBC
The next-generation text editor, powered by AI that writes with you, not for you.
real time face swap and one-click video deepfake with only a single image
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and Tradin…
ThetaGang is an IBKR bot for collecting money
Linux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
ML-powered speech recognition directly in your browser
esp32 based device, mainly used for voice chat with large language models
Live Image Classification on ESP32-CAM and TFT with MobileNet v1 from Edge Impulse (TinyML)
Official Implementation of EnCLAP (ICASSP 2024)
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
The subtitles and translations are generated in real-time and displayed as pop-ups.
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
A toolkit for developing and comparing reinforcement learning algorithms.
#1 Locally hosted web application that allows you to perform various operations on PDF files
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)