Martinser

🎯

Focusing

Ge Wu Martinser

🎯

Focusing

36 followers · 281 following

Nankai University

Lists (28)

Sort

Starred repositories

zhixuan-lin / forgetting-transformer

Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 28 Updated Mar 2, 2025

yuecao0119 / MMFuser

The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". MMFuser addresses the limitations of current MLLMs in captur…

Python 49 4 Updated Nov 5, 2024

xinwangChen / EDT

Python 8 Updated Feb 6, 2025

NVlabs / QLIP

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 55 1 Updated Mar 1, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,063 5,998 Updated Mar 3, 2025

hustvl / LightningDiT

[CVPR 2025] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 380 7 Updated Feb 27, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,170 2,088 Updated Mar 3, 2025

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,116 64 Updated Jul 20, 2024

Huage001 / LinFusion

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Python 291 18 Updated Dec 23, 2024

czg1225 / CoDe

[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 82 4 Updated Mar 2, 2025

x-cls / superclass

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 200 7 Updated Jan 13, 2025

Anima-Lab / MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Python 392 15 Updated May 15, 2024

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 988 40 Updated Feb 23, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 2,678 214 Updated Jan 24, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,486 212 Updated Feb 12, 2025

Pepper-lll / LMforImageGeneration

Codebase for the paper-Elucidating the design space of language models for image generation

Python 45 1 Updated Nov 17, 2024

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 284 13 Updated Sep 9, 2024

haoosz / BiGR

[ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Python 138 1 Updated Jan 26, 2025

microsoft / Reducio-VAE

Jupyter Notebook 189 6 Updated Feb 11, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 698 37 Updated Feb 24, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,309 71 Updated Sep 27, 2024

yongliang-wu / NumPro

[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga

Python 56 1 Updated Nov 29, 2024

thu-ml / CCA

Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"

Python 26 Updated Feb 11, 2025

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 849 40 Updated Jan 28, 2025

SonyResearch / micro_diffusion

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,260 49 Updated Jan 12, 2025

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 796 108 Updated Oct 25, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,598 2,828 Updated Feb 25, 2024

liming-ai / ControlNet_Plus_Plus

Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.

Python 480 21 Updated Jan 12, 2025

baaivision / DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 263 14 Updated Jan 23, 2025

Juanerx / Q-DiT

[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Python 38 4 Updated Sep 3, 2024

Ge Wu Martinser

Lists (28)

3D

clip

CoT

datasets

DETR

Diffusion

🔮 Future ideas

GAN

GPT

latex

Linear attention

Lora

MAE

mamba

Mixup/Cutmix

MLP

Moe

Network

NLP

Optimizers

OVSS+OVD

RNN

SAM

Semantic Segmentation

Uncertainty

Wait

work

Writing

Starred repositories

mae