Stars
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
✨✨Latest Advances on Multimodal Large Language Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
InstantIR: Blind Image Restoration with Instant Generative Reference 🔥
This repository contains scripts to build Youtube Gesture Dataset.
[CVPR'23, Highlight] ECON: Explicit Clothed humans Optimized via Normal integration
Large dataset of hand-object contact, hand- and object-pose, and 2.9 M RGB-D grasp images.
Ongoing research training transformer models at scale
Official PyTorch implementation of "InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image", ECCV 2020
📖 A curated list of resources dedicated to talking face.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
High-resolution models for human tasks.
A Collection of Variational Autoencoders (VAE) in PyTorch.
Official inference repo for FLUX.1 models
GenEval: An object-focused framework for evaluating text-to-image alignment
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A Python port of the MATLAB reference implementation
Blur Detection with OpenCV in Python
Some commonly-used image quality assessment algorithms.
Perceptual video quality assessment based on multi-method fusion.