Highlights
- Pro
Stars
Extends the support of Merlin firmware to more ASUS routers
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Mustango: Toward Controllable Text-to-Music Generation
An easy-to-use library for skin tone classification
Image Forgery Detection and Localization (and related) Papers List
Official repository of "Deep Image Composition Meets Image Forgery"
The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.
✏️ Web-based image segmentation tool for object detection, localization, and keypoints with RITM support
Skin Reflectance Estimate Based on Dichromatic Separation (SREDS)
MiVOLO age & gender transformer neural network
Task-Optimized Adapters for an End-to-End Dialogue System Paper Code
🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
Reference code for the paper HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms (CVPR 2021).
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Create web-based user interfaces with Python. The nice way.
guillermogotre / CUSP
Forked from NVlabs/stylegan2-ada-pytorchOfficial code for "Custom Structure Preservation in Face Aging"
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
DynaGAN: Dynamic Few-shot Adaptation of GANs to Multiple Domains (SIGGRAPH Asia 2022)
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Materials for the Hugging Face Diffusion Models Course