Lists (1)
Sort Name ascending (A-Z)
Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
Interact with your documents using the power of GPT, 100% privately, no data leaks
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
real time face swap and one-click video deepfake with only a single image
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Open-Sora: Democratizing Efficient Video Production for All
The best and simplest free open source web page change detection, website watcher, restock monitor and notification service. Restock Monitor, change detection. Designed for simplicity - Simply moni…
Industry leading face manipulation platform
An open-source RAG-based tool for chatting with your documents.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
OCR, layout analysis, reading order, table recognition in 90+ languages
🕵️♂️ Collect a dossier on a person by username from thousands of sites
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Command-line program to download image galleries and collections from several image hosting sites
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs