Skip to content

Privacy in AI systems

Privacy in AI systems concerns protecting sensitive information from unauthorized access, extraction, or inference. Key concerns include:

Membership inference attacks: Determining whether a specific data point was in the training set by analyzing model outputs.

Data extraction: Recovering training data or reconstructing private information from models through query-based or black-box attacks.

Model inversion: Reconstructing input data from model outputs (e.g., recovering images from embeddings).

Sensitive data leakage: Models may memorize and leak personally identifiable information during inference.

Privacy in multimodal models: Vision-language models and diffusion models can leak information about training images when manipulated adversarially.

Key papers