Skip to content
View z-jiaming's full-sized avatar
🧐
🧐

Highlights

  • Pro

Block or report z-jiaming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

CSS 407 24 Updated Mar 1, 2025

A pipeline parallel training script for diffusion models.

Python 588 58 Updated Feb 27, 2025

Enjoy the magic of Diffusion models!

Python 7,549 677 Updated Mar 2, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 6,110 571 Updated Feb 28, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,370 114 Updated Feb 28, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 10,927 737 Updated Mar 1, 2025

This is a repo to track the latest autoregressive visual generation papers.

150 Updated Mar 1, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,537 262 Updated Feb 19, 2025

Let's finetune video generation models!

Python 410 15 Updated Feb 24, 2025

A collection of vision foundation models unifying understanding and generation.

42 2 Updated Jan 2, 2025

Xiaomi Home Integration for Home Assistant

Python 18,652 921 Updated Feb 28, 2025

LoRAT_pytracking: reproduction of [ECCV2024] LoRAT

Python 38 2 Updated Dec 9, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,968 240 Updated Mar 1, 2025

text window manager, shell multiplexer, integrated DevOps environment

Shell 1,292 125 Updated May 2, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,571 1,764 Updated Feb 26, 2025

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,132 72 Updated Jul 14, 2024

A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.

Python 842 172 Updated Aug 3, 2023

[NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model

Python 90 3 Updated Jun 13, 2024

This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision

Python 69 1 Updated Jun 17, 2024

Official repository of MLLA (NeurIPS 2024)

Python 276 16 Updated Nov 25, 2024

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 37,701 3,881 Updated Jan 2, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,768 443 Updated Jan 12, 2025

Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.

Python 22 3 Updated Mar 2, 2025
Jupyter Notebook 175 9 Updated Feb 8, 2025

Official PyTorch Implementation of "The Hidden Attention of Mamba Models"

Python 214 13 Updated May 27, 2024

Simba

Python 201 19 Updated Mar 24, 2024

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 907 68 Updated Jul 6, 2024

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,418 89 Updated Sep 7, 2023

Diffusion Model-Based Image Editing: A Survey (arXiv)

566 36 Updated Feb 27, 2025
Next