Skip to content

Truthfulness

Truthfulness refers to the property of statements matching reality. In the context of AI, truthfulness standards aim to prevent AI systems from generating false claims, either accidentally (negligent falsehoods) or strategically (lies). This is distinct from honesty (whether a system's statements match its own beliefs) and from transparency or explainability.

Key papers