Skip to content

AI-Generated Text Detection

Detection of machine-generated text from large language models and other AI systems has become increasingly important as generative models improve. The challenge involves distinguishing between human-authored and AI-authored text to prevent misuse including plagiarism, disinformation, and fraud.

Detection approaches include watermarking (embedding hidden patterns during generation), neural network-based classifiers trained on human vs. AI text, zero-shot methods based on statistical properties, and retrieval-based systems. However, adversarial attacks on these detectors—through paraphrasing, prompt engineering, and other evasion techniques—demonstrate fundamental limitations in reliable detection.

Key papers