Starred repositories
Reiverr is a clean combined interface for Jellyfin, TMDB, Radarr and Sonarr, as well as a replacement to Overseerr
Generative models for conditional audio generation
Mora: More like Sora for Generalist Video Generation
Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.
ClusterPlex is an extended version of Plex, which supports distributed Workers across a cluster to handle transcoding requests.
A Twitch-inspired webpage that allows synced video and chat playback.
A content management system for video catalogs with chat replay capability.
A script that will parse, modify and update a Google Photos archive for upload to an iCloud Photo Library
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Gecko - A Tool for Effective Annotation of Human Conversations
MinLabel is a voice label tool based on Tkinter in Python3.
Build smaller, faster, and more secure desktop and mobile applications with a web frontend.
Your self hosted YouTube media server
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
🦜🔗 Build context-aware reasoning applications
A command-line interface to generate textual and conversational datasets with LLMs.
Daily is a tool to transcode scene-linear openexr images into display-referred quicktime movies.
LlamaIndex is a data framework for your LLM applications
[ECCV 2022] Relighting4D: Neural Relightable Human from Videos
Insert DFL metadata in square JPG image files
Grayscale SAEHD model and mode for training deepfakes. Notes, tests, experience, tools, study and explanations of the source code.
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
Automatically empty the trash in all of your Plex libraries