Stars
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
The official gpt4free repository | various collection of powerful language models | o3 mini and deepseek r1
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
A generative speech model for daily dialogue.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, OpenAI, and more.
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
Question and Answer based on Anything.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
An open source implementation of CLIP.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Android in docker solution with noVNC supported and video recording
UI Automation Framework for Games and Apps
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
A UI-Focused Agent for Windows OS Interaction.
🤖 史上最强云手机远程桌面逆向抓包HOOK自动化取证能力集一体的安卓 RPA 框架,下一代移动数据自动化机器人。
跨平台 Python 异步聊天机器人框架 / Asynchronous multi-platform chatbot framework written in Python
a state-of-the-art-level open visual language model | 多模态预训练模型
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
A large model-based chatbot builder that can quickly integrate AI models (including ChatGPT, Claude, Gemini) into various software applications (such as Telegram, Gmail, Slack, and websites).
Universal and Transferable Attacks on Aligned Language Models