Lists (7)
Sort Name ascending (A-Z)
Starred repositories
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…
A Chrome Extensions boilerplate using React 18 and Webpack 5.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
跨平台视频提取工具:支持流媒体下载、视频下载、m3u8 下载及 B站视频下载,提供 Windows 和 Mac 桌面客户端。Cross-platform video extraction tool: Supports streaming download, video download, m3u8 download, and Bilibili video download, with des…
Comflowyspace is an intuitive, user-friendly, open-source AI tool for generating images and videos, democratizing access to AI technology.
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Open source short video automatic generation tool
A fluent API to FFMPEG (http://www.ffmpeg.org)
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Pythonic AI generation of images and videos
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
小视频宝:AI 驱动的视频生成工具,一键生成高质量营销视频 AI-powered video generation tool for creating high-quality marketing videos with one click.
Model Context Protocol Servers
Deskflow lets you share one mouse and keyboard between multiple computers on Windows, macOS and Linux. It's like a software KVM (but without video).
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Free youtube video uploader with no limits
Twitter bot powered by OpenAI's ChatGPT API. It's aliveeeee 🤖
Generate tiktok signature token using node
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)