zhang0jhon

zhang0jhon

41 followers · 9 following

Achievements

Stars

ToTheBeginning / PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,039 214 Updated Nov 27, 2024

tyxsspa / AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,495 290 Updated Jun 21, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,862 1,046 Updated Jan 20, 2025

NUS-HPC-AI-Lab / Neural-Network-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 834 45 Updated Jan 3, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,467 981 Updated Jan 22, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 3,186 191 Updated Jan 30, 2025

genmoai / mochi

The best OSS video generation models

Python 2,799 289 Updated Jan 8, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 2,680 227 Updated Jan 3, 2025

VideoVerses / VideoTuna

Let's finetune video generation models!

Python 376 15 Updated Jan 30, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,478 423 Updated Jan 12, 2025

jy0205 / Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,731 268 Updated Dec 21, 2024

buoyancy99 / diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 698 35 Updated Dec 11, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,949 640 Updated Jan 24, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,824 1,390 Updated Jan 9, 2025

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,748 86 Updated Oct 31, 2024

Fanghua-Yu / SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Python 4,781 404 Updated Jul 30, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,435 232 Updated Jun 14, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,194 2,283 Updated Jan 22, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,760 600 Updated May 31, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,838 1,381 Updated Dec 25, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 5,703 956 Updated Jan 29, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,642 1,443 Updated Sep 5, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,273 734 Updated Aug 12, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,460 195 Updated Dec 3, 2024

zhang0jhon / otamatch

Python 2 1 Updated May 28, 2024

diff-usion / Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

HTML 11,364 957 Updated Aug 1, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,343 58 Updated Dec 10, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,403 174 Updated Aug 1, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,147 468 Updated Nov 6, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 533 61 Updated Jun 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhang0jhon

Achievements

Achievements

Block or report zhang0jhon

Stars

ToTheBeginning / PuLID

tyxsspa / AnyText

PKU-YuanGroup / Open-Sora-Plan

NUS-HPC-AI-Lab / Neural-Network-Diffusion

THUDM / CogVideo

NVlabs / Sana

genmoai / mochi

Lightricks / LTX-Video

VideoVerses / VideoTuna

FoundationVision / VAR

jy0205 / Pyramid-Flow

buoyancy99 / diffusion-forcing

Tencent / HunyuanVideo

black-forest-labs / flux

PixArt-alpha / PixArt-sigma

Fanghua-Yu / SUPIR

luosiallen / latent-consistency-model

hpcaitech / Open-Sora

facebookresearch / DiT

facebookresearch / sam2

meta-llama / llama-models

IDEA-Research / Grounded-Segment-Anything

IDEA-Research / GroundingDINO

jingyi0000 / VLM_survey

zhang0jhon / otamatch

diff-usion / Awesome-Diffusion-Models

facebookresearch / MetaCLIP

baaivision / EVA

OpenBMB / MiniCPM

FoundationVision / Groma