Skip to content

Deepfakes

Deepfakes are synthetic videos or audio produced using machine learning techniques (primarily generative adversarial networks, or GANs) that convincingly depict a person saying or doing something they did not actually say or do. The term combines "deep learning" and "fake," reflecting the AI technology underlying their creation.

Deepfakes represent a new frontier in misinformation and video-based disinformation because they exploit the psychological power of visual evidence—people are more likely to believe video than text or still images—while removing the traditional requirement that misinformation creators have physical access to subjects or sophisticated video production resources.

Technical mechanisms

Generative Adversarial Networks (GANs): Two neural networks compete—a generator creates synthetic video frames, while a discriminator tries to distinguish them from real ones. This adversarial loop produces increasingly realistic results.

Face-swap techniques: Identity of one person is transferred onto the face of another in video, preserving lip-sync and facial expressions.

Voice synthesis: Deepfake audio uses neural text-to-speech or voice-conversion models to generate realistic speech that matches a speaker's acoustic patterns.

Self-reenactment: A speaker's facial expressions and head movements from one video are transferred to another video, enabling realistic video editing without identity replacement.

Why deepfakes matter to misinformation research

  1. Lower barriers to creation: Unlike traditional video forgery (which required expertise in video editing), deepfakes can be created with limited technical skill using publicly available tools and training datasets.

  2. Visual persuasion: Video is inherently more persuasive than text or images. The saying "seeing is believing" reflects people's tendency to treat video as direct evidence of truth. This makes deepfake video particularly dangerous.

  3. Realism heuristic: People judge credibility partly on how realistic content appears. Deepfakes that exploit this by depicting already-known public figures (whose appearance is familiar) are more believable than deepfakes of unknown people.

  4. Uncertainty amplification: Even when not fully believed, deepfakes create uncertainty ("Did this really happen?"), which erodes trust in institutions and media (see Vaccari & Chadwick (2020)).

  5. Political weaponization potential: Deepfakes of political leaders could spread during elections, creating uncertainty about authentic statements and damaging trust in democratic discourse.

Types of deepfakes

  • Political deepfakes: Synthetic videos of politicians saying compromising statements; the Obama/Peele deepfake circulated widely on social media in 2018
  • Non-consensual intimate deepfakes: Synthetic sexual imagery of real people, predominantly targeting women; a form of harassment and abuse
  • Fraudulent deepfakes: Voice cloning and video synthesis for financial fraud (e.g., deepfake audio of a CEO authorizing a wire transfer)
  • Manipulated media: Realistic but subtle edits (lip-sync manipulation, expression transfer) that fall short of full deepfakes but still mislead

Detectability and detection methods

Humans are generally poor at detecting deepfakes by eye. Research shows: - Wang et al. found that humans correctly identify deepfakes only ~50% of the time—statistically indistinguishable from random guessing - Compression artifacts, eye blinks, facial symmetry, and audio-visual asynchrony can indicate deepfakes, but these signals are increasingly subtle in newer generation models

Automated detection methods include: - Frequency-domain analysis: Deepfakes often exhibit artifacts in Fourier space due to GAN compression - Behavioral inconsistencies: Unnatural eye movement, facial expression sequences, or head pose trajectories - Audio-visual synchronization: Lip-sync mismatches or timing inconsistencies between mouth and speech - Forensic analysis: Camera noise patterns, sensor artifacts, or lighting inconsistencies

However, detection is an "arms race": as detection techniques improve, generation techniques advance to evade them.

Key resources

  • Jevin West — Misinformation and Data Literacy — discusses emerging synthetic-media threats (photoshopping, voice synthesis, deepfakes) and their role in escalating misinformation landscapes; notes public literacy lags behind technical sophistication

Key papers in this wiki

Open challenges

  • How do we scale detection methods to social media platforms operating at billions of videos per day?
  • What interventions most effectively reduce both belief in and uncertainty about deepfakes?
  • How do deepfakes interact with existing political polarization and partisan trust asymmetries?
  • What are the long-term effects of deepfake exposure on institutional trust and civic participation?
  • How do non-English-language deepfakes spread differently than English-language ones?