Skip to content

Synthetic media

Synthetic media refers to content (images, video, audio, or text) created or manipulated using generative artificial intelligence models rather than captured directly from reality. Synthetic media encompasses a broader category than deepfakes: it includes AI-generated text, images, audio, and video, as well as manipulated versions of existing media.

Categories of synthetic media

Image generation: - Text-to-image models (DALL-E, Midjourney, Stable Diffusion) generate photorealistic images from text prompts - Synthetic portraits of non-existent people (created by GANs) - AI art and creative imagery

Audio synthesis: - Text-to-speech (TTS) systems generating natural-sounding speech in any voice or language - Voice conversion and voice cloning creating synthetic speech that mimics a specific speaker - Music generation and sound design

Video synthesis: - Deepfakes and facial reenactment (see Deepfakes) - Text-to-video models generating realistic video from text descriptions - Neural video synthesis and super-resolution

Textual synthesis: - Large language models (GPT-3, GPT-4, LLaMA) generating coherent text indistinguishable from human writing - AI-written news articles, social media posts, and disinformation content

Why synthetic media matters to misinformation research

  1. Democratization of content creation: Traditional barriers to creating convincing fake audio, video, or images required technical expertise. AI tools increasingly commodify this capability.

  2. Speed and scale: Generative models can produce thousands or millions of variations of synthetic content at scale and speed humans cannot match.

  3. Emotional authenticity: Synthetic media often preserves subtle emotional and behavioral cues that make it psychologically persuasive even when technically flawed.

  4. Dual-use challenge: The same technology used for creative expression (art, entertainment, education) can be weaponized for misinformation, fraud, and harassment.

  5. Attribution difficulty: Determining the origin of synthetic media and tracing it back to creators is technically and legally complex.

Psychological and epistemic implications

The proliferation of synthetic media threatens several epistemic foundations:

  • Visual evidence: When anyone can generate photorealistic images or video, visual content becomes insufficient as evidence of truth
  • Voice authentication: Synthetic audio undermines voice recognition as a form of identity verification
  • Media ecosystem trust: If the public cannot distinguish synthetic from authentic media, trust in media institutions erodes broadly
  • Reality perception: Chronic exposure to synthetic media may create "reality confusion" where people struggle to identify authentic events

Key papers in this wiki

Open challenges

  • How do we develop scalable detection methods that work across image, audio, and video modalities and don't require labeled training data for each new synthetic technique?
  • What are the most effective interventions for educating publics about synthetic media without creating blanket skepticism toward all media?
  • How do regulations balance restrictions on malicious synthetic media with protection for creative and legitimate uses?
  • What transparency and attribution standards should apply to AI-generated content in journalism, advertising, and public communication?