Lists (2)
Sort Name ascending (A-Z)
Stars
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Awesome-LLM: a curated list of Large Language Model
Stable Diffusion web UI
aider is AI pair programming in your terminal
Rust API to get all user desktop data (local, cross platform, 24/7, screen, voice, keyboard, mouse, camera recording). sandboxed js plugin system. keyboard and mouse control
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
🚀🤖 Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
ML-powered speech recognition directly in your browser
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
llama and other large language models on iOS and MacOS offline using GGML library.
Open source free capture HTTP(S) traffic software ProxyPin, supporting full platform systems
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Robust Speech Recognition via Large-Scale Weak Supervision
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.