yeungchenwa

Work Hard to Enjoy Life

Zhenhua Yang yeungchenwa

Work Hard to Enjoy Life

Master Studnet in SCUT @HCIILAB. Intern in @IDEA-Research.

79 followers · 49 following

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Starred repositories

deepseek-ai / DeepSeek-V3

Python 12,962 915 Updated Dec 31, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 13,481 1,443 Updated Jan 1, 2025

lllyasviel / IC-Light

More relighting!

Python 7,215 422 Updated Nov 28, 2024

PWenJay / GCA-HNG

Official Code for IJCV 2024 paper — Globally Correlation-Aware Hard Negative Generation

Python 12 Updated Dec 18, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 21,115 1,640 Updated Jan 2, 2025

yeungchenwa / HDR

[AAAI2025] Predicting the Original Appearance of Damaged Historical Documents

Python 54 6 Updated Dec 17, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,920 524 Updated Dec 31, 2024

wangjiangshan0725 / RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 339 8 Updated Dec 16, 2024

lehduong / OneDiffusion

Official implementation of OneDiffusion paper

Python 557 19 Updated Dec 14, 2024

IDEA-Research / ChatRex

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 118 3 Updated Nov 28, 2024

ACAT-SCUT / CycleNet

[NeurIPS 2024 Spotlight] Official repository of the CycleNet paper: "CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns". This work is developed by the Lab of Professor …

Jupyter Notebook 114 13 Updated Dec 18, 2024

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 617 29 Updated Nov 20, 2024

OpenGVLab / Diffree

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Python 232 13 Updated Aug 6, 2024

Tencent / Hunyuan3D-1

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 2,499 190 Updated Dec 26, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,462 565 Updated Dec 31, 2024

ostris / ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Python 3,712 413 Updated Dec 31, 2024

lucidrains / MIMO-pytorch

Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group

Python 129 6 Updated Sep 30, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 18,890 1,336 Updated Dec 31, 2024

TencentARC / MasaCtrl

[ICCV 2023] Consistent Image Synthesis and Editing

Python 754 30 Updated Aug 19, 2024

menyifang / MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

1,386 55 Updated Nov 26, 2024

Adamdad / kat

Kolmogorov-Arnold Transformer: A PyTorch Implementation with CUDA kernel

Python 647 36 Updated Oct 8, 2024

Gy920 / segment-anything-2-real-time

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 245 43 Updated Dec 13, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,024 242 Updated Dec 4, 2024

Jeff-LiangF / streamv2v

Official Pytorch implementation of StreamV2V.

Python 458 52 Updated Dec 24, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,116 46 Updated Dec 26, 2024

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 1,952 188 Updated Dec 29, 2024

Yuliang-Liu / Open-Oracle

AI-assisted Deciphering Oracle Bone Script

42 Updated Sep 15, 2024

H-Freax / Awesome-Video-Robotic-Papers

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

133 6 Updated Aug 12, 2024

BradyFU / Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

432 18 Updated Dec 14, 2024

lss-1138 / SparseTSF

[ICML 2024 Oral] Official repository of the SparseTSF paper: "SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters". This work is developed by the Lab of Professor Weiwei Lin (l…

Python 166 16 Updated Dec 8, 2024

Tensorflow

Python

Markdown

Machine learning

Linux

GitHub API

Git

Docker

Deep learning

Data structures

See all starred topics