StyleID - Official Pytorch implementation

StyleID: Identity Disentanglement for Anonymizing Faces

Minh-Ha Le and Niklas Carlsson

Abstract: Privacy of machine learning models is one of the remaining challenges that hinder the broad adoption of Artificial Intelligent (AI). This paper considers this problem in the context of image datasets containing faces. Anonymization of such datasets is becoming increasingly important due to their central role in the training of autonomous cars, for example, and the vast amount of data generated by surveillance systems. While most prior work de-identifies facial images by modifying identity features in pixel space, we instead project the image onto the latent space of a Generative Adversarial Network (GAN) model, find the features that provide the biggest identity disentanglement, and then manipulate these features in latent space, pixel space, or both. The main contribution of the paper is the design of a feature-preserving anonymization framework, StyleID, which protects the individuals’ identity, while preserving as many characteristics of the original faces in the image dataset as possible. As part of the contribution, we present a novel disentanglement metric, three complementing disentanglement methods, and new insights into identity disentanglement. StyleID provides tunable privacy, has low computational complexity, and is shown to outperform current state-of-the-art solutions.

Description

This is the official implementation of StyleID, a framework to disentangle and anonymize identity in facial images. The framework uses the highly disentangled latent space of a pre-trained StyleGAN generator. Three methods of disentanglement for anonymizing face are presented:

(1) Disentanglement in latent space
(2) Disentanglement in pixel space
(3) Latent swapper

The face generator is Rosinality pytorch implementation of StyleGAN2. The model has been modified to fit with our framework.

Shortcut

Without any hassle of setting up environment, a demo of our work can be run on google Colab on this link:

Prerequisites

This framework has been successfully tested on:

Ubuntu 18.04
Pytorch 1.7.1
Cuda Toolkit 11.2.2

Hardware:

CPU: Intel core i7 10th generation
GPU: Nvidia RTX 3090 (this is not a requirements, any GPU with at least 12GB of VRAM should be enough)
RAM: 32 GB DDR4

Installation

We recommend to set up the virtual environment by Anaconda:

conda env update -n styleid --file ./requirements.yaml
pip install --upgrade --no-cache-dir gdown
wget https://github.com/ninja-build/ninja/releases/download/v1.8.2/ninja-linux.zip
sudo unzip ninja-linux.zip -d /usr/local/bin/
sudo update-alternatives --install /usr/bin/ninja ninja /usr/local/bin/ninja 1 --force

Special requirements for Nvidia RTX 30 series

Install Cuda Toolkit 11.2.2
Pytorch 1.7.1:

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio===0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

Download required pre-trained models and statistics

Pre-trained models will be downloaded by our provided script utils/download_files.py (or manually download from link).

python utils/download_files.py

Note that a few others files will be downloaded (while the script is running) including:

Latent codes of faces in CelebA-HQ dataset
Identity disentanglement rank for single channels
Identity disentanglement rank for group of 256 channels

In addition, we provide various auxiliary models needed for training the mapper as well as pretrained models needed for computing the identity disentanglement metrics reported in the paper.

Path	Description
Segmentation to Image	pSp trained with the CelebAMask-HQ dataset for image synthesis from segmentation maps.
IR-SE50 Model	Pretrained IR-SE50 model taken from TreB1eN for use in our ID loss during pSp training.
CurricularFace Backbone	Pretrained CurricularFace model taken from HuangYG123 for use in ID similarity metric computation.
MTCNN	Weights for MTCNN model taken from TreB1eN for use in ID similarity metric computation. (Unpack the tar.gz to extract the 3 model weights.)

All of these models should be placed in ./pretrained_models/ folder.

Usage

Given an input facial image, the identity can be anonymized by swaping to a specific target or randomly generated target. For simplicity of demonstration, we choose both input face and target face randomly.

Google Colab

We highly recommend running our demo on Google Colab to avoid any issue related to setting up environment. All the three methods are demonstrated in notebook.ipynb notebook ().

Run locally

Open Jupyter Lab at current StyleID folder:

jupyter lab

Navigate and open notebook.ipyb. Please ignore the sections of "Initial setup" and "Downloading models and data" as they are meant for Google Colab only.

Directories' description

Directory	Description
`configs`	General configs for datasets, paths, and transforms that used by the framework.
`criteria`	Modules and classes for loss functions.
`datasets`	Modules and classes to handle datasets.
`mapper`	Specific options and data classes for the latent mapper.
`models`	Models to be used by StyleIDs.
`pretrained_models`	Weights of pre-trained models will be placed here.
`utils`	Utilities that will be used by StyleID.

Citation:

If you use our code, please cite our paper:

@inproceedings{le2023styleid,
  title={StyleID: Identity Disentanglement for Anonymizing Faces},
  author={Le, Minh Ha and Carlsson, Niklas},
  booktitle={Proceedings on Privacy Enhancing Technologies (PoPETs)},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
configs		configs
criteria		criteria
datasets		datasets
mapper		mapper
models		models
pretrained_models/mtcnn		pretrained_models/mtcnn
utils		utils
CelebAMask-HQ-attribute-anno.txt		CelebAMask-HQ-attribute-anno.txt
LICENSE.md		LICENSE.md
README.md		README.md
StyleID.ipynb		StyleID.ipynb
model_utils.py		model_utils.py
requirements.yaml		requirements.yaml
styleid.py		styleid.py
unet.py		unet.py
utils_seg.py		utils_seg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleID - Official Pytorch implementation

Description

Shortcut

Prerequisites

Installation

Special requirements for Nvidia RTX 30 series

Download required pre-trained models and statistics

Usage

Google Colab

Run locally

Directories' description

Citation:

About

Releases

Packages

Languages

License

2112105214/StyleID

Folders and files

Latest commit

History

Repository files navigation

StyleID - Official Pytorch implementation

Description

Shortcut

Prerequisites

Installation

Special requirements for Nvidia RTX 30 series

Download required pre-trained models and statistics

Usage

Google Colab

Run locally

Directories' description

Citation:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages