Synthetic media¶

Synthetic media refers to content (images, video, audio, or text) created or manipulated using generative artificial intelligence models rather than captured directly from reality. Synthetic media encompasses a broader category than deepfakes: it includes AI-generated text, images, audio, and video, as well as manipulated versions of existing media.

Categories of synthetic media¶

Image generation: - Text-to-image models (DALL-E, Midjourney, Stable Diffusion) generate photorealistic images from text prompts - Synthetic portraits of non-existent people (created by GANs) - AI art and creative imagery

Audio synthesis: - Text-to-speech (TTS) systems generating natural-sounding speech in any voice or language - Voice conversion and voice cloning creating synthetic speech that mimics a specific speaker - Music generation and sound design

Video synthesis: - Deepfakes and facial reenactment (see Deepfakes) - Text-to-video models generating realistic video from text descriptions - Neural video synthesis and super-resolution

Textual synthesis: - Large language models (GPT-3, GPT-4, LLaMA) generating coherent text indistinguishable from human writing - AI-written news articles, social media posts, and disinformation content

Why synthetic media matters to misinformation research¶

Democratization of content creation: Traditional barriers to creating convincing fake audio, video, or images required technical expertise. AI tools increasingly commodify this capability.
Speed and scale: Generative models can produce thousands or millions of variations of synthetic content at scale and speed humans cannot match.
Emotional authenticity: Synthetic media often preserves subtle emotional and behavioral cues that make it psychologically persuasive even when technically flawed.
Dual-use challenge: The same technology used for creative expression (art, entertainment, education) can be weaponized for misinformation, fraud, and harassment.
Attribution difficulty: Determining the origin of synthetic media and tracing it back to creators is technically and legally complex.

Psychological and epistemic implications¶

The proliferation of synthetic media threatens several epistemic foundations:

Visual evidence: When anyone can generate photorealistic images or video, visual content becomes insufficient as evidence of truth
Voice authentication: Synthetic audio undermines voice recognition as a form of identity verification
Media ecosystem trust: If the public cannot distinguish synthetic from authentic media, trust in media institutions erodes broadly
Reality perception: Chronic exposure to synthetic media may create "reality confusion" where people struggle to identify authentic events

Key papers in this wiki¶

Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data — empirical taxonomy of real-world GenAI misuse; documents pervasive use of synthetic media across image, audio, and video for falsification, impersonation, and content scaling at scale
The Creation and Detection of Deepfakes: A Survey — comprehensive survey covering creation and detection of synthetic facial media; foundational for understanding generative model architectures and detection methods across deepfake creation types
Rössler et al. (2019) — FaceForensics++: Learning to Detect Manipulated Facial Images — largest facial forgery dataset with four manipulation methods; comprehensive evaluation of detection approaches from stegananalysis to deep learning; human baseline study showing XceptionNet significantly outperforms humans
Dolhansky et al. (2020) — The DeepFake Detection Challenge (DFDC) Dataset — largest deepfake detection benchmark with 128,154 videos from 3,426 consenting actors; addresses ethical limitations of prior datasets; demonstrates that detection remains difficult despite scale and diversity
Fagni et al. (2020) — TweepFake: about detecting deepfake tweets — First public dataset of human vs. machine-generated tweets; benchmarks detection of synthetic text on social media; demonstrates that transformer-based fine-tuned models are effective but that modern generators (GPT-2) remain challenging to detect

Deepfakes — synthetic video/audio of specific people created with GANs
Disinformation — intentionally false information; synthetic media is one vector
Trust in institutions and communicators — synthetic media's primary risk is erosion of trust through authenticity confusion
Generated text detection — methods for detecting machine-generated text

Open challenges¶

How do we develop scalable detection methods that work across image, audio, and video modalities and don't require labeled training data for each new synthetic technique?
What are the most effective interventions for educating publics about synthetic media without creating blanket skepticism toward all media?
How do regulations balance restrictions on malicious synthetic media with protection for creative and legitimate uses?
What transparency and attribution standards should apply to AI-generated content in journalism, advertising, and public communication?