Skip to content
View Martinser's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Martinser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 28 Updated Mar 2, 2025

The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". MMFuser addresses the limitations of current MLLMs in captur…

Python 49 4 Updated Nov 5, 2024
Python 8 Updated Feb 6, 2025

[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Jupyter Notebook 55 1 Updated Mar 1, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,063 5,998 Updated Mar 3, 2025

[CVPR 2025] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 380 7 Updated Feb 27, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,170 2,088 Updated Mar 3, 2025

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,116 64 Updated Jul 20, 2024

Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"

Python 291 18 Updated Dec 23, 2024

[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 82 4 Updated Mar 2, 2025

[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training

Python 200 7 Updated Jan 13, 2025

Code for Fast Training of Diffusion Models with Masked Transformers

Python 392 15 Updated May 15, 2024

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 988 40 Updated Feb 23, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,678 214 Updated Jan 24, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,486 212 Updated Feb 12, 2025

Codebase for the paper-Elucidating the design space of language models for image generation

Python 45 1 Updated Nov 17, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 284 13 Updated Sep 9, 2024

[ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Python 138 1 Updated Jan 26, 2025
Jupyter Notebook 189 6 Updated Feb 11, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 698 37 Updated Feb 24, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,309 71 Updated Sep 27, 2024

[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga

Python 56 1 Updated Nov 29, 2024

Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"

Python 26 Updated Feb 11, 2025

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 849 40 Updated Jan 28, 2025

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,260 49 Updated Jan 12, 2025

Fast Diffusion Models with Transformers

Python 796 108 Updated Oct 25, 2024

Let us control diffusion models!

Python 31,598 2,828 Updated Feb 25, 2024

Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.

Python 480 21 Updated Jan 12, 2025

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 263 14 Updated Jan 23, 2025

[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Python 38 4 Updated Sep 3, 2024
Next