DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

This repository is the official implementation of DisPose.

📖 Table of Contents

🎨 Gallery
🧙 Method Overview
🔧 Preparations
- Setup repository and conda environment
- Prepare model weights
💫 Inference
📣 Disclaimer
💞 Acknowledgements

🎨 Gallery

case1.mp4

case2.mp4

case3.mp4

case4.mp4

case5.mp4

🧙 Method Overview

We present DisPose to mine more generalizable and effective control signals without additional dense input, which disentangles the sparse skeleton pose in human image animation into motion field guidance and keypoint correspondence.

🔧 Preparations

Setup repository and conda environment

The code requires python>=3.10, as well as torch>=2.0.1 and torchvision>=0.15.2. Please follow the instructions here to install both PyTorch and TorchVision dependencies. The demo has been tested on CUDA version of 12.4.

conda create -n dispose python==3.10
conda activate dispose
pip install -r requirements.txt

Prepare model weights

Download the weights of DisPose and put DisPose.pth into ./pretrained_weights/.
Download the weights of other components and put them into ./pretrained_weights/:

Downlaod the weights of CMP and put it into ./mimicmotion/modules/cmp/experiments/semiauto_annot/resnet50_vip+mpii_liteflow/checkpoints

Finally, these weights should be organized in ./pretrained_weights/. as follows:

./pretrained_weights/
|-- MimicMotion_1-1.pth
|-- DisPose.pth
|-- dwpose
|   |-- dw-ll_ucoco_384.onnx
|   └── yolox_l.onnx
|-- stable-diffusion-v1-5
|-- stable-video-diffusion-img2vid-xt-1-1

💫 Inference

A sample configuration for testing is provided as test.yaml. You can also easily modify the various configurations according to your needs.

bash scripts/test.sh

Tips

If your GPU memory is limited, try set decode_chunk_size in test.yaml to 1.
If you want to enhance the quality of the generated video, you could try some post-processing such as face swapping (insightface) and frame interpolation (IFRNet).

📣 Disclaimer

This is official code of DisPose. All the copyrights of the demo images and videos are from community users. Feel free to contact us if you would like remove them.

💞 Acknowledgements

We sincerely appreciate the code release of the following projects: MimicMotion, Moore-AnimateAnyone, CMP.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets/example_data		assets/example_data
configs		configs
mimicmotion		mimicmotion
scripts		scripts
.gitignore		.gitignore
README.md		README.md
inference_ctrl.py		inference_ctrl.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

🎨 Gallery

🧙 Method Overview

🔧 Preparations

Setup repository and conda environment

Prepare model weights

💫 Inference

Tips

📣 Disclaimer

💞 Acknowledgements

About

Releases

Packages

Contributors 2

Languages

lihxxx/DisPose

Folders and files

Latest commit

History

Repository files navigation

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

🎨 Gallery

🧙 Method Overview

🔧 Preparations

Setup repository and conda environment

Prepare model weights

💫 Inference

Tips

📣 Disclaimer

💞 Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages