Skip to content
View Y-P-Zhang's full-sized avatar

Block or report Y-P-Zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,963 1,140 Updated May 23, 2024

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 210 16 Updated Oct 17, 2024

A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.

Python 15 1 Updated Jul 17, 2024

Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.

Python 456 19 Updated Dec 9, 2024

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 515 32 Updated Sep 19, 2024

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,384 55 Updated Nov 26, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 19,220 2,935 Updated Dec 12, 2024

Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"

Python 1,096 39 Updated Nov 6, 2024

HaMeR: Reconstructing Hands in 3D with Transformers

Python 474 47 Updated Oct 28, 2024

[TPAMI'23] Unifying Flow, Stereo and Depth Estimation

Python 1,149 118 Updated May 10, 2024

[ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

43 1 Updated Oct 8, 2024

A collection of awesome video generation studies.

TeX 405 15 Updated Jan 1, 2025

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,641 262 Updated Dec 21, 2024

The official Meta Llama 3 GitHub site

Python 27,733 3,171 Updated Aug 12, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 585 40 Updated Dec 16, 2024
Python 864 121 Updated Dec 11, 2024

High-resolution models for human tasks.

Python 4,704 270 Updated Nov 18, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 4,938 653 Updated Dec 24, 2024

Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.

Python 243 30 Updated Dec 28, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,405 1,289 Updated Dec 25, 2024

Kolors Team

Python 4,053 299 Updated Nov 13, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,168 548 Updated Jul 17, 2024

The Data and Code of Prompt2Sign: A Comprehensive Multilingual Sign Language Dataset.

Python 151 9 Updated Nov 25, 2024

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 4,565 514 Updated Jan 2, 2025
Python 72 2 Updated Jul 8, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,544 269 Updated Jun 28, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,822 1,040 Updated Dec 31, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,935 2,257 Updated Dec 27, 2024
Next