Skip to content

Generative AI Misuse

Generative AI misuse refers to the deliberate use of generative AI tools and models by individuals and organizations to facilitate, augment, or execute actions that may cause downstream harm. This includes both exploitation of GenAI capabilities (e.g., creating deepfakes, generating fraudulent content, impersonating individuals) and technical attacks on GenAI systems themselves (e.g., prompt injection, model poisoning).

Scope

The landscape of GenAI misuse encompasses:

  • Capability exploitation: leveraging text, image, audio, and video generation to create synthetic content, impersonate individuals, falsify evidence, or scale harmful operations
  • System attacks: adversarial inputs, prompt injection, jailbreaking, model extraction, and data poisoning
  • Motivation-driven applications: opinion manipulation, monetization, fraud, harassment, and reach/advocacy
  • Actor diversity: from state-sponsored entities to private corporations to individual users, with varying technical sophistication

Early empirical evidence (2023–2024) suggests that most real-world misuse does not involve sophisticated technical attacks but rather exploitation of easily accessible GenAI capabilities for goals long predating GenAI (impersonation, forgery, scams). The democratization of GenAI tools has lowered barriers to entry, enabling a broader pool of actors to engage in misuse with minimal technical expertise.

Key papers