Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Search SVG Icons. Easily include popular icons in your React projects and provide an easy tool to convert SVG into React components. @icongo
截屏 离线OCR 搜索翻译 以图搜图 贴图 录屏 万向滚动截屏 屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
High performance self-hosted photo and video management solution.
A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.
独立开发/出海开发相关技术栈及工具收录 / Find the best tools for indie hackers here
Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages
A Download Manager that speeds up your downloads
A serverless web app to organize and stream media from anywhere.
A mobile-friendly WebUI to run ComfyUI workflows.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
🔥小巧、美观的桌面快速启动工具 Small, beautiful desktop quickstart management tool with integrated Everything search
A TV show and movie player application.
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
An AI focused photo manipulation tool based on Gradio
Enhancing Face Realism / Epic Realism [ LoRA ]
ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.
Tiny status page generated by a Python script
Mini-Cover:简洁的在线生成封面网站,专为博客、短视频、社交媒体等生成个性化封面
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Chat first code editor. To download the packaged app:
AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using A…