Lists (4)
Sort Name ascending (A-Z)
Stars
Build effective agents using Model Context Protocol and simple workflow patterns
A browser extension that helps users publish content to multiple social media platforms with one click.
Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌
A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SERP APIs with MCP server integration.
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.
Give Cursor Agent an AI Team and Advanced Skills
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
Model Context Protocol Servers
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Fully local web research and report writing assistant
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, dashboards and BI. 📈📊📋🧑💻
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Model Context Protocol tool support for LangChain
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
Multilingual Voice Understanding Model
Whisper realtime streaming for long speech-to-text transcription and translation
End-to-end stack for WebRTC. SFU media server and SDKs.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🪞 Instant AI Face Swap, Hairstyles & Outfits — One click to a brand new you! 一键 AI 换脸、发型、穿搭,发现更美的你