Stars
ComfyUI nodes for Janus-Pro, a unified multimodal understanding and generation framework.
AI powered speech denoising and enhancement
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
Ongoing maintenance of Steven Han's original Easy Bridge.
William Whitaker's WORDS, a Latin dictionary
Python program to delay/advance subtitles in .srt files
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
mirror of https://gitlab.mister-muffin.de/josch/img2pdf for Travis and appveyor CI
pygccxml is a specialized XML reader that reads the output from CastXML. It provides a simple framework to navigate C++ declarations, using Python classes.
A PHP component to convert HTML into a plain text format
A Gradio web UI for Large Language Models with support for multiple inference backends.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
🦜🔗 Build context-aware reasoning applications
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
The Neolatin Wordlist (Neulateinische Wortliste; NLW) is a lexical resource that collects entries from Latin texts written in Europe between 1300 and 1600. The wordlist consists of circa 22000 lemm…
Open source neural network chess engine with GPU acceleration and broad hardware support.
Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
WebUI extension for ControlNet
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focuse…