-
POSTECH
- South Korea
- https://mingukkang.github.io/
- @minguk_kang
Lists (1)
Sort Name ascending (A-Z)
Stars
[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
InstantDrag: Improving Interactivity in Drag-based Image Editing
[ECCV2024] Official Implementation of "NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image"
Author's Implementation for E-LatentLPIPS
Unofficial implementation of E-LatentLPIPS in Diffusion2GAN
Official inference repo for FLUX.1 models
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
AuraSR: GAN-based Super-Resolution for real-world
Unofficial Implementation of E-LatentLPIPS(Ensembled-LatentLPIPS) of Diffusion2GAN
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Website source files for Diffusion2GAN Project.
Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Poster)
[CVPR'24] Official PyTorch implementation of Contrastive Mean-Shift Learning for Generalized Category Discovery
EDM2 and Autoguidance -- Official PyTorch implementation
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"
Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".