Skip to content

Face tampering detection

Face tampering detection focuses on identifying images where a person's face has been manipulated, spliced from another image, or swapped with another identity. This includes both face swapping (replacing one face with another using morphing algorithms) and face splicing (cutting and pasting a face region between images).

Key distinguishing cues

Visual artifacts: Face swapping and splicing operations often leave visual artifacts including: - Stitching artifacts and boundary discontinuities - Unnatural edges (especially around lips and facial hair) - Blurring or smoothing near boundaries to blend the tampered region - Illumination inconsistencies

Low-level signal inconsistencies: - Camera Color Filter Array (CFA) pattern differences between the swapped face and surrounding regions - Local noise residual mismatches due to in-camera processing - Double JPEG compression artifacts (background compressed twice, tampered region once)

Physiological constraints: - Head pose inconsistencies between face landmarks and anatomical impossibilities - Eye gaze direction mismatches - Facial expression incompatibilities with body position

Approaches

Steganalysis-based: Extract low-level features originally designed for steganography detection (e.g., CFA-aware features, noise residuals) and apply SVM or other classifiers. These capture camera characteristics and in-camera processing artifacts.

CNN-based: Fine-tune deep neural networks (e.g., GoogLeNet, ResNet) on tampering datasets to learn high-level visual artifacts. Often combined with preprocessing such as Error Level Analysis.

Multi-stream fusion: Combine high-level visual artifact detection with low-level noise analysis in a single model to improve robustness to post-processing.

Forensic features: Use domain knowledge of image forensics (CFA patterns, JPEG artifacts, illumination models) as hand-crafted or CNN-extracted features.

Key papers

Datasets

Challenges

  • Post-processing robustness: High-quality face tampering includes boundary blurring, blending, and resizing, which remove obvious visual artifacts and challenge forensic techniques.
  • Dataset bias: Methods trained on one face-swapping algorithm may overfit to its specific artifacts and fail on others.
  • Computational cost: Per-image SVM training or feature extraction can be expensive at scale.
  • Small faces: Upsampling artifacts and limited patch coverage make detection difficult for small tampered faces (< 50×50 pixels).

Connections