Skip to content
View zkcys001's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zkcys001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
130 results for source starred repositories
Clear filter

R1-onevision, a visual language model capable of deep CoT reasoning.

273 5 Updated Feb 28, 2025

Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"

Jupyter Notebook 82 4 Updated Mar 21, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,465 2,164 Updated Feb 1, 2025

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 146 11 Updated Jun 21, 2024

Official implementation of "DepthLab: From Partial to Complete"

Python 452 24 Updated Feb 14, 2025

[CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 481 32 Updated Feb 27, 2025

[CVPR'25] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 124 5 Updated Dec 20, 2024

[ICLR 2025] Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Python 239 25 Updated Feb 11, 2025

[CVPR 2025] Assessing and Learning Alignment of Unimodal Vision and Language Models

Jupyter Notebook 23 1 Updated Feb 27, 2025
Python 26 Updated Dec 12, 2024

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 30 Updated Jan 9, 2025
Python 34 Updated Feb 27, 2025
JavaScript 3 Updated Jan 14, 2025

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,976 295 Updated Feb 27, 2025

Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

Python 300 14 Updated Apr 25, 2024

LLM2CLIP makes SOTA pretrained CLIP model more SOTA ever.

Python 475 23 Updated Jan 17, 2025

[ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".

Python 435 21 Updated Jan 9, 2025

[ICLR 2025] Animate-X - PyTorch Implementation

Python 301 9 Updated Jan 24, 2025

[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Python 40 3 Updated Jan 14, 2025

CVPR2023 | MVImgNet: A Large-scale Dataset of Multi-view Images

Python 421 8 Updated Apr 9, 2024

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

Jupyter Notebook 239 14 Updated Jan 22, 2025

Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

591 35 Updated Apr 4, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 903 39 Updated Sep 27, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,507 756 Updated Aug 12, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,841 1,029 Updated Mar 1, 2025

[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs

Python 100 4 Updated Nov 6, 2024

[ECCV 2024] Official PyTorch implementation of GANdance: Exploring Guided Sampling of Conditional GANs

4 Updated Jul 16, 2024

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

486 18 Updated Oct 31, 2024

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 266 9 Updated Dec 4, 2024
Python 225 16 Updated Apr 10, 2024
Next