Skip to content
View xiezheng-cs's full-sized avatar
😃
😃
  • SCUT-SMIL
  • ShenZhen, China

Block or report xiezheng-cs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5,750 942 Updated Feb 24, 2025

More relighting!

Python 7,592 465 Updated Feb 20, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,760 711 Updated Feb 20, 2025

We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a se…

Python 1,153 67 Updated Dec 30, 2024

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,160 482 Updated Jul 10, 2024

The official implementation of RealisDance

C 305 18 Updated Nov 14, 2024
Python 781 96 Updated Dec 11, 2024

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

Jupyter Notebook 8,438 897 Updated Feb 5, 2025

Inpaint anything using Segment Anything and inpainting models.

Jupyter Notebook 6,917 589 Updated Feb 29, 2024
Python 14 2 Updated Jul 14, 2024

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 211 9 Updated Nov 22, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,078 1,088 Updated Feb 25, 2025

Multilingual Voice Understanding Model

Python 4,619 418 Updated Jan 8, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,489 498 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 34,705 3,738 Updated Feb 18, 2025

SOTA Open Source TTS

Python 19,509 1,510 Updated Feb 18, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 7,469 713 Updated Feb 3, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,207 4,593 Updated Feb 24, 2025

unofficial inplementation of paper Underexposed Photo Enhancement using Deep Illumination Estimation(2019 CVPR)

Python 86 16 Updated Jun 10, 2021

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 6,076 874 Updated May 13, 2024

Code for "Joint Denoising and Demosaicking with Green Channel Prior for Real-world Burst Images", TIP2021

Python 45 7 Updated Aug 9, 2021

The official PyTorch implementation of DeeDSR

18 Updated Mar 31, 2024

Code of "ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution"

Python 69 5 Updated Mar 20, 2023

Diff tool for comparing Win32 resources in PE images

C++ 6 3 Updated Mar 4, 2020
Python 100 4 Updated Jan 8, 2025

End-to-End Learning for Joint Image Demosaicing, Denoising and Super-Resolution

Python 54 5 Updated Nov 28, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,906 1,055 Updated Feb 19, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,411 2,316 Updated Feb 21, 2025

Webpage of Livenet technique

CSS 1 1 Updated Dec 20, 2023

[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Python 492 35 Updated Oct 24, 2024
Next