Skip to content

Adversarial training

Adversarial training is a defense strategy where models are trained on a mixture of clean and adversarial examples to improve robustness. By exposing models to perturbations and attacks during training, adversarial training can reduce the model's vulnerability to novel attacks and improve generalization.

Key papers