-
Fudan University
- Shanghai, China
-
06:06
(UTC +08:00) - https://scholar.google.com/citations?user=t88nyvsAAAAJ
Stars
HunyuanVideo: A Systematic Framework For Large Video Generation Model
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
(ICCV2023) A large-pose Flickr face dataset comprised of 19,590 high-quality real large-pose portrait images.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
State-of-the-art 2D and 3D Face Analysis Project
Code and dataset for photorealistic Codec Avatars driven from audio
Pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"
LivePortrait for AUTOMATIC1111 Stable Diffusion WebUI
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"
Official inference repo for FLUX.1 models
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control