Stars
🚀🚀🚀一款漂亮易用的在线设计器,支持PSD导入、PSD解析,可用于海报设计器、广告设计器、logo设计器、AI创作图片合成器等。常用于生成二维码海报,图片海报,二维码推广海报,图片处理,名片设计,电商产品图,节假日海报等。http://gzm-design-doc.guozimi.cn/
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Python APIs for web automation, testing, and bypassing bot-detection.
🏡 Open source home automation that puts local control and privacy first.
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
🛏 An HTML to Markdown converter written in JavaScript
A bidirectional Markdown to HTML to Markdown converter written in Javascript
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Convert Markdown to Word (.docx). / 将 markdown 文件转换为 Word(.docx)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Manipulate audio with a simple and easy high level interface
开源微信爬虫:爬取公众号所有 文章、阅读量、点赞量和评论内容。易部署。持续维护!!!
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
A nearly-live implementation of OpenAI's Whisper.
Real time transcription with OpenAI Whisper.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Cross-platform automation framework for all kinds of apps, built on top of the W3C WebDriver protocol
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Real time speech to text transcription app.
Transcribe and translate your audio files - for free
Whisper realtime streaming for long speech-to-text transcription and translation
A modern Swift SDK for OpenAI's Realtime API
Node.js + JavaScript reference client for the Realtime API (beta)
React app for inspecting, building and debugging with the Realtime API
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.