Skip to content

Explainable AI

Explainable AI (XAI) refers to methods for making AI systems' decisions transparent and understandable to humans. In the context of fake news detection, explainability addresses a critical need: users and fact-checkers must understand why a system flagged content as misinformation to trust the system and potentially correct it.

Explainability approaches operate at different levels:

  • Local explanations: why a particular prediction was made for a specific input (e.g., which words or features drove a fake news classification)
  • Global explanations: what patterns the model generally uses to make decisions across all inputs
  • Rule-based explanations: decompositional or symbolic representations (decision trees, logic rules) that are human-readable by design
  • Post-hoc explanations: applying interpretability techniques (LIME, SHAP, attention weights) to already-trained "black-box" models

Key papers

  • TELLER: A Trustworthy Framework For Explainable, Generalizable and Controllable Fake News Detection — dual-system approach to explainability: cognition system decomposes claims into interpretable yes/no questions (driven by LLMs), decision system learns transparent logic rules via disjunctive normal form; enables human understanding of reasoning chain and human intervention via rule adjustment
  • [[2019-cui-claim-verification-explanations|Cui et al. (2019)]] — provides evidence and explanations for claim verification; uses feature importance and evidence-based reasoning
  • [[2023-liao-interpretable-misinformation]] — interpretable multimodal misinformation detection with logic reasoning

Connections

  • Trustworthy AI — explainability is one pillar of trustworthiness; others include robustness, fairness, and controllability
  • Neural-symbolic AI — neural-symbolic systems are naturally more interpretable than pure neural approaches
  • Fake news detection methods — many detection methods lack explainability; new work prioritizes transparent architectures