[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs".

Python 33 2 Updated Jun 17, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,149 1,050 Updated Dec 14, 2024

Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,731 180 Updated Sep 28, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,627 5,488 Updated Dec 14, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,811 126 Updated Dec 11, 2024

THUDM / Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 383 19 Updated Jul 5, 2024

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 771 30 Updated Dec 4, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,717 1,034 Updated Dec 13, 2024

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 272 7 Updated Jul 9, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,108 88 Updated Aug 6, 2024

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,669 213 Updated Sep 8, 2024

songweige / TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 270 17 Updated May 1, 2024

HVision-NKU / StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,030 602 Updated Sep 26, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 6,169 418 Updated Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rui Tian ruitian12

Achievements

Achievements

Block or report ruitian12

Lists (1)

Unified MLLM

Stars

microsoft / Reducio-VAE

MengLcool / SliMM

YiwengXie / Chat-Video

open-compass / VLMEvalKit

mit-han-lab / vila-u

deepseek-ai / Janus

facebookresearch / MovieGenBench

richzhang / PerceptualSimilarity

NVIDIA / Megatron-LM

THUDM / CogVideo

Vchitect / VBench

thunlp / InfLLM

LLaVA-VL / LLaVA-NeXT

IDEA-Research / Grounded-SAM-2

apple / ml-4m

black-forest-labs / flux

MengLcool / DeepStack-VL