DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy (ICML 2025)

Abstract

Statistical adversarial data detection (SADD) detects whether an upcoming batch contains adversarial examples (AEs) by measuring the distributional discrepancies between clean examples (CEs) and AEs. In this paper, we reveal the potential strength of SADD-based methods by theoretically showing that minimizing distributional discrepancy can help reduce the expected loss on AEs. Nevertheless, despite these advantages, SADD-based methods have a potential limitation: they discard inputs detected as AEs, leading to the loss of clean information within those inputs. To address this limitation, we propose a two-pronged adversarial defense method, named Distributional-Discrepancy-based Adversarial Defense (DDAD). In the training phase, DDAD first optimizes the test power of the maximum mean discrepancy (MMD) to derive MMD-OPT, and then trains a denoiser by minimizing the MMD-OPT between CEs and AEs. In the inference phase, DDAD first leverages MMD-OPT to differentiate CEs and AEs, and then applies a two-pronged process: (1) directly feeding the detected CEs into the classifier, and (2) removing noise from the detected AEs by the distributional-discrepancy-based denoiser. Extensive experiments show that DDAD outperforms current state-of-the-art (SOTA) defense methods by notably improving clean and robust accuracy on CIFAR-10 and ImageNet-1K against adaptive white-box attacks.

Figure 1: The illustration of our method.

Installation

The code is tested with Python 3.9. To install the requried packages, run:

pip install -r requirements.txt

Pre-trained Classifiers

The checkpoint of pre-trained classifiers on CIFAR-10 should be put in checkpoint/CIFAR10/[your model name]. For example, the checkpoint of pre-trained WideResNet-28-10 on CIFAR-10 should be put in checkpoint/CIFAR10/WRN28.

The training recipe of ResNet and WideResNet on CIFAR-10 follows the GitHub.
To train ResNet and WideResNet:

git clone https://github.com/meliketoy/wide-resnet.pytorch.git

# train a WRN-28-10
python3 main.py --lr 0.1 --net_type 'wide-resnet' --depth 28 --widen_factor 10 --dataset 'cifar10'

# train a WRN-70-16
python3 main.py --lr 0.1 --net_type 'wide-resnet' --depth 70 --widen_factor 16 --dataset 'cifar10'

# train a RN-18
python3 main.py --lr 0.1 --net_type 'resnet' --depth 18 --dataset 'cifar10'

# train a RN-50
python3 main.py --lr 0.1 --net_type 'resnet' --depth 50 --dataset 'cifar10'

The training recipe of Swin-Transformer on CIFAR-10 follows the GitHub.
To train a Swin-Transformer:

git clone https://github.com/kentaroy47/vision-transformers-cifar10.git

python train_cifar10.py --net swin --n_epochs 400

The pre-trained ResNet-50 on ImageNet-1K follows the Pytorch implmentation with ResNet50_Weights.IMAGENET1K_V2.

Run Experiments

Train DDAD on CIFAR-10

Generate training data for MMD and denoiser:

cd dataset

python3 cifar10.py

Train DDAD:

# train DDAD on WRN-28-10
python3 train.py --data 'CIFAR10' --model 'wrn28' --epochs 60 

# train DDAD on WRN-70-16
python3 train.py --data 'CIFAR10' --model 'wrn70' --epochs 60

Train DDAD on ImageNet-1K

Generate training data for MMD and denoiser:

cd dataset

python3 imagenet.py

Generate adversarial data for training MMD-OPT and denoiser:

python3 adv_generator.py  --mode 'train' --data 'ImageNet' --model 'rn50' --attack 'mma'

Train DDAD:

python3 train.py --data 'ImageNet' --model 'rn50' --epochs 60

Evaluate DDAD against adaptive PGD+EOT whitebox attacks

python3 adaptive_whitebox_attack.py

Evaluate DDAD against transfer attacks

Generate transfer attacks:

# Take RN18 as an example

python3 adv_generator.py  --mode 'test' --data 'CIFAR10' --model 'rn18' --attack 'eotpgd' --epsilon 8/255

python3 adv_generator.py  --mode 'test' --data 'CIFAR10' --model 'rn18' --attack 'cw' --num-step 200 --epsilon 0.5

python3 adv_generator.py  --mode 'test' --data 'CIFAR10' --model 'rn18' --attack 'eotpgd' --epsilon 12/255

python3 adv_generator.py  --mode 'test' --data 'CIFAR10' --model 'rn18' --attack 'cw' --num-step 200 --epsilon 1.0

Evaluate the performance against transfer attacks:

# Take RN18 as an example

python3 transfer_attack.py --model 'rn18' --epsilon 8/255

python3 transfer_attack.py --model 'rn18' --epsilon 12/255

BPDA+EOT Implementation and Evaluation

The implementation and evaluation of BPDA+EOT strictly follows the paper with the following GitHub.

License and Contributing

This README is formatted based on the NeurIPS guideline.
Feel free to post any issues via GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
dataset		dataset
images		images
models		models
.gitignore		.gitignore
README.md		README.md
adaptive_whitebox_attack.py		adaptive_whitebox_attack.py
adv_generator.py		adv_generator.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
transfer_attack.py		transfer_attack.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy (ICML 2025)

Abstract

Figure 1: The illustration of our method.

Installation

Pre-trained Classifiers

Run Experiments

Train DDAD on CIFAR-10

Train DDAD on ImageNet-1K

Evaluate DDAD against adaptive PGD+EOT whitebox attacks

Evaluate DDAD against transfer attacks

BPDA+EOT Implementation and Evaluation

License and Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

tmlr-group/DAD

Folders and files

Latest commit

History

Repository files navigation

DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy (ICML 2025)

Abstract

Figure 1: The illustration of our method.

Installation

Pre-trained Classifiers

Run Experiments

Train DDAD on CIFAR-10

Train DDAD on ImageNet-1K

Evaluate DDAD against adaptive PGD+EOT whitebox attacks

Evaluate DDAD against transfer attacks

BPDA+EOT Implementation and Evaluation

License and Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages