Add README and description of the paper

aam-at · Jun 4, 2021 · c54c4cd · c54c4cd
1 parent fbdb0e5
commit c54c4cd
Show file tree

Hide file tree

Showing 34 changed files with 5,007 additions and 82 deletions.
diff --git a/README.md b/README.md
@@ -0,0 +1,67 @@
+# PDPGD: Primal-Dual Proximal Gradient Descent Adversarial Attack
+
+[Alexander Matyasko](https://github.com/aam-at), Lap-Pui Chau, [**PDPGD: Primal-Dual Proximal Gradient Descent Adversarial Attack**](https://arxiv.org/abs/2106.01538).
+
+State-of-the-art deep neural networks are sensitive to small input perturbations. Since the discovery of this intriguing vulnerability, many defence methods have been proposed that attempt to improve robustness to adversarial noise. Fast and accurate attacks are required to compare various defence methods. However, evaluating adversarial robustness has proven to be extremely challenging. Existing norm minimisation adversarial attacks require thousands of iterations (e.g. Carlini & Wagner attack), are limited to the specific norms (e.g. Fast Adaptive Boundary), or produce sub-optimal results (e.g. Brendel & Bethge attack). On the other hand, PGD attack, which is fast, general and accurate, ignores the norm minimisation penalty and solves a simpler perturbation-constrained problem. In this work, we introduce a fast, general and accurate adversarial attack that optimises the original non-convex constrained minimisation problem. We interpret optimising the Lagrangian of the adversarial attack optimisation problem as a two-player game: the first player minimises the Lagrangian wrt the adversarial noise; the second player maximises the Lagrangian wrt the regularisation penalty. Our attack algorithm simultaneously optimises primal and dual variables to find the minimal adversarial perturbation. In addition, for non-smooth lp-norm minimisation, such as linf-, l1-, and l0-norms, we introduce primal-dual proximal gradient descent attack. We show in the experiments that our attack outperforms current state-of-the-art linf-, l2-, l1-, and l0-attacks on MNIST, CIFAR-10 and Restricted ImageNet datasets against unregularised and adversarially trained models.
+
+
+```txt
+@Article{	  matyasko2021pdpgd,
+  author	= {Alexander Matyasko, Lap-Pui Chau},
+  title		= {PDPGD: Primal-Dual Proximal Gradient Descent Adversarial
+		  Attack},
+  journal	= {arXiv e-prints},
+  year		= 2021,
+  pages		= {arXiv:2106.01538},
+  primaryclass	= {cs.LG}
+}
+```
+## Installation
+
+Clone the repository and add the repository directory to `PYTHONPATH`. `git-lfs`
+must be installed to automatically download the naturally trained, linf- and l2-
+adversarially trained models for MNIST and CIFAR-10. The models for the
+experiments on Restricted ImageNet can be downloaded from
+[url](https://github.com/MadryLab/robust-features-code).
+
+```bash
+git clone https://github.com/aam-at/cpgd
+export PYTHONPATH="$PYTHONPATH:<PATH_TO_CLONE_DIR>"
+```
+
+## Requirements (tested with python 3.6)
+- Tensorflow v2.2.0
+- Pytorch v1.5.0
+- Foolbox v3.2.1
+- Cleverhans (https://github.com/aam-at/cleverhans_tf2)
+- Adversarial Robustness Toolbox v1.4.2
+
+See `requirements.txt` for the list of full requirements.
+
+## Implementation details
+
+Our attacks and utilities are implemented in `lib` directory. `attack_lp`
+implements abstract classes `PrimalDualGradientAttack` and
+`ProximalPrimalDualGradientAttack`, which corresponds to our PDGD and PDPGD
+attacks, respectively.
+
+## Steps to reproduce results
+
+We provide scripts to reproduce all the results for all datasets, attacks and
+models in folder `experiments`.
+
+For example, one can run all experiments on MNIST for DeepFool li- and l2-norm
+attack using the following commands:
+
+```bash
+cd experiments
+./mnist_deepfool.sh
+```
+
+We provide scripts to parse the results of all experiments and export it as an
+Excel spreadsheet:
+
+```bash
+cd mnist
+python parse_results.py
+```
diff --git a/cifar10/parse_results.py b/cifar10/parse_results.py
@@ -0,0 +1,111 @@
+from functools import partial
+
+import numpy as np
+from lib.parse_results import output_excel, parse_results
+
+base_dir = "../results_cifar10"
+parse_results = partial(parse_results, base_dir=base_dir)
+output_excel = partial(output_excel, base_dir=base_dir)
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "df", sort_column="li_corr")
+    output_excel(df, model_type, "li", "df")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "bethge", sort_column="li_corr")
+    output_excel(df, model_type, "li", "bethge")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "daa", sort_column="acc_adv", group_by="attack_eps")
+    output_excel(df, model_type, "li", "daa", group_by="attack_eps")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "pgd", sort_column="acc_adv", group_by="attack_eps")
+    output_excel(df, model_type, "li", "pgd", group_by="attack_eps")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "fab")
+    output_excel(df, model_type, "li", "fab")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "li", "our_li", group_by="attack_loop_number_restarts")
+    output_excel(df, model_type, "li", "our_li", group_by="attack_loop_number_restarts")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "df")
+    output_excel(df, model_type, "l2", "df")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "cw")
+    output_excel(df, model_type, "l2", "cw")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "ddn")
+    output_excel(df, model_type, "l2", "ddn")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "bethge")
+    output_excel(df, model_type, "l2", "bethge")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "pgd", sort_column="acc_adv", group_by="attack_eps")
+    output_excel(df, model_type, "l2", "pgd", group_by="attack_eps")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "fab")
+    output_excel(df, model_type, "l2", "fab")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l2", "our_l2", group_by="attack_loop_number_restarts")
+    output_excel(df, model_type, "l2", "our_l2", group_by="attack_loop_number_restarts")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "sparsefool")
+    output_excel(df, model_type, "l1", "sparsefool")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "ead")
+    output_excel(df, model_type, "l1", "ead")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "bethge")
+    output_excel(df, model_type, "l1", "bethge")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "pgd", sort_column="acc_adv", group_by="attack_eps")
+    output_excel(df, model_type, "l1", "pgd", group_by="attack_eps")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "fab")
+    output_excel(df, model_type, "l1", "fab")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "our_l1", group_by="attack_loop_number_restarts")
+    output_excel(df, model_type, "l1", "our_l1", group_by="attack_loop_number_restarts")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l1", "sparsefool", sort_column="l0_corr")
+    output_excel(df, model_type, "l0", "sparsefool")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "jsma")
+    output_excel(df, model_type, "l0", "jsma")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "one_pixel")
+    output_excel(df, model_type, "l0", "one_pixel")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "bethge")
+    output_excel(df, model_type, "l0", "bethge")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "cornersearch")
+    output_excel(df, model_type, "l0", "cornersearch")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "pgd", sort_column="acc_adv", group_by="attack_eps")
+    output_excel(df, model_type, "l0", "pgd", group_by="attack_eps")
+
+for model_type in ["plain", "linf", "l2"]:
+    df = parse_results(model_type, "l0", "our_l0", group_by="attack_loop_number_restarts")
+    output_excel(df, model_type, "l0", "our_l0", group_by="attack_loop_number_restarts")