Explainable AI¶

Explainable AI (XAI) refers to methods for making AI systems' decisions transparent and understandable to humans. In the context of fake news detection, explainability addresses a critical need: users and fact-checkers must understand why a system flagged content as misinformation to trust the system and potentially correct it.

Explainability approaches operate at different levels:

Local explanations: why a particular prediction was made for a specific input (e.g., which words or features drove a fake news classification)
Global explanations: what patterns the model generally uses to make decisions across all inputs
Rule-based explanations: decompositional or symbolic representations (decision trees, logic rules) that are human-readable by design
Post-hoc explanations: applying interpretability techniques (LIME, SHAP, attention weights) to already-trained "black-box" models

Key papers¶

TELLER: A Trustworthy Framework For Explainable, Generalizable and Controllable Fake News Detection — dual-system approach to explainability: cognition system decomposes claims into interpretable yes/no questions (driven by LLMs), decision system learns transparent logic rules via disjunctive normal form; enables human understanding of reasoning chain and human intervention via rule adjustment
[[2019-cui-claim-verification-explanations|Cui et al. (2019)]] — provides evidence and explanations for claim verification; uses feature importance and evidence-based reasoning
[[2023-liao-interpretable-misinformation]] — interpretable multimodal misinformation detection with logic reasoning

Connections¶

Trustworthy AI — explainability is one pillar of trustworthiness; others include robustness, fairness, and controllability
Neural-symbolic AI — neural-symbolic systems are naturally more interpretable than pure neural approaches
Fake news detection methods — many detection methods lack explainability; new work prioritizes transparent architectures