jjihwan

🚀

Jihwan Kim jjihwan

🚀

Computer Vision Researcher @ SNU CVLAB

60 followers · 68 following

Seoul National University
Seoul, Korea
jjihwan.github.io
in/jjihwan

Achievements

x2 x2

Achievements

x2 x2

Highlights

Lists (1)

Sort

🚀 My stack

1 repository

Stars

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 838 40 Updated Jan 28, 2025

dhryougit / learning-to-translate-noise

Python 10 1 Updated Dec 11, 2024

kwsong0113 / diffusion-forcing-transformer

Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 146 2 Updated Feb 17, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 21,288 1,870 Updated Feb 24, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,015 96 Updated Jan 2, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,113 94 Updated Feb 12, 2025

deepseek-ai / DeepSeek-R1

81,045 10,464 Updated Feb 24, 2025

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 481 67 Updated Feb 14, 2025

intuitive-robots / mdt_policy

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

C++ 100 9 Updated Oct 16, 2024

Hleephilip / CSG

Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).

Python 8 Updated Jul 19, 2023

a-r-r-o-w / finetrainers

Memory-optimized training scripts for video models based on Diffusers

Python 876 94 Updated Feb 24, 2025

AILab-CVC / CV-VAE

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 265 9 Updated Dec 4, 2024

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 836 32 Updated Feb 19, 2025

NX-AI / vision-lstm

xLSTM as Generic Vision Backbone

Python 463 32 Updated Nov 4, 2024

NX-AI / xlstm

Official repository of the xLSTM.

Python 1,704 129 Updated Jan 14, 2025

Everlyn-Labs / Everlyn-1

The first open autoregressive foundational video AI model.

2,873 488 Updated Oct 14, 2024

VideoVerses / VideoVAEPlus

VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE

Python 286 7 Updated Jan 19, 2025

HyeonHo99 / Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Python 187 8 Updated Mar 29, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,583 70 Updated Aug 15, 2024

NX-AI / xlstm-jax

Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.co/NX-AI/xLSTM-7b.

Python 82 2 Updated Jan 8, 2025