OTAvatar : One-shot Talking Face Avatar with Controllable Tri-plane Rendering

Paper | Demo

Update

April.4: The preprocessed dataset is released, please see the Data preparation section. Some missing files are also uploaded.

Get started

Environment Setup

git clone [email protected]:theEricMa/OTAvatar.git
cd OTAvatar
conda env create -f environment.yml
conda activate otavatar

Pre-trained Models

Create pretrained folder under the root directory.

Download and copy EG3D FFHQ model from offical webpage to the pretrained directory. Choose the model with the name of ffhqrebalanced512-64.pkl.

Download arcface_resnet18.pth and save to the pretrained directory.

Data preparation

We upload the processed dataset in Google Drive and Baidu Netdisk (password: CBSR). Then in the root directory,

mkdir datasets
mv <your hdtf_lmdb_inv path> datasets/

Generally the processing scripts is a mixture of that in PIRenderer and ADNeRF. We plan to further open a new repo to upload our revised preocessing script.

Face Animation

Create the folder result/otavatarif it does not exist. Please the model (TODO) under this directory. Run,

export CUDA_VISIBLE_DEVICES=0
python -m torch.distributed.launch --nproc_per_node=1 --master_port 12345 inference_refine_1D_cam.py \
--config ./config/otavatar.yaml \
--name config/otavatar.yaml \
--no_resume \
--which_iter 2000 \
--image_size 512 \
--ws_plus \
--cross_id \
--cross_id_target WRA_EricCantor_000 \
--output_dir ./result/otavatar/evaluation/cross_ws_plus_WRA_EricCantor_000

To animate each identity given the motion from WRA_EricCantor_000.

Or simply run,

sh scripts/inference.sh

Start Training

Run,

export CUDA_VISIBLE_DEVICES=0,1,2,3
python -m torch.distributed.launch --nproc_per_node=4 --master_port 12346 train_inversion.py \
--config ./config/otavatar.yaml \
--name otavatar

Or simply run,

sh scripts/train.sh

Acknowledgement

We appreciate the model or code from EG3D, PIRenderer, StyleHEAT, EG3D-projector.

Citation

If you find this work helpful, please cite:

@article{ma2023otavatar,
  title={OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering},
  author={Ma, Zhiyuan and Zhu, Xiangyu and Qi, Guojun and Lei, Zhen and Zhang, Lei},
  journal={arXiv preprint arXiv:2303.14662},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
config		config
data		data
dnnlib		dnnlib
figures		figures
loss		loss
models		models
options		options
scripts		scripts
torch_utils		torch_utils
trainers		trainers
training		training
util		util
README.md		README.md
config.py		config.py
environment.yaml		environment.yaml
inference_refine_1D_cam.py		inference_refine_1D_cam.py
train_inversion.py		train_inversion.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OTAvatar : One-shot Talking Face Avatar with Controllable Tri-plane Rendering

Paper | Demo

Update

Get started

Environment Setup

Pre-trained Models

Data preparation

Face Animation

Start Training

Acknowledgement

Citation

About

Releases

Packages

Languages

liulei2020/OTAvatar

Folders and files

Latest commit

History

Repository files navigation

OTAvatar : One-shot Talking Face Avatar with Controllable Tri-plane Rendering

Paper | Demo

Update

Get started

Environment Setup

Pre-trained Models

Data preparation

Face Animation

Start Training

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages