Fact-Checking Meets Fauxtography: Verifying Claims About Images¶

Authors: Dimitrina Zlatkova, Preslav Nakov, Ivan Koychev

Venue: arXiv, 2019 — arXiv:1908.11722

TL;DR¶

This paper introduces the problem of verifying claims made about images and creates a dataset of 1,233 image-claim pairs labeled as true or false. Using features from reverse image search (media source credibility, article titles), image metadata (Google tags), and claim-article similarity, a linear SVM classifier achieves 80% accuracy, substantially above the 50% baseline.

Contributions¶

Formulates the novel task of predicting factuality of image-claim pairs, distinct from manipulated image detection.
Releases two curated datasets: 838 examples from Snopes Fauxtography section (imbalanced, 197 true / 641 false) and 395 true examples from Reuters Pictures of the Year (2015–2018).
Explores three feature classes: image-based (Google tags, URL domains, media source credibility), claim-based (TF-IDF), and relationship-based (cosine and embedding similarity between claim and reverse image search results).
Demonstrates sizable improvements over majority baseline; top features are URL domains (89.7% AP on combined test set) and claim-article similarity metrics.

Method¶

The system uses reverse image search (Google Vision API) to retrieve web pages containing the image. For each image:

Image features: Google's automatic tags associated with the image; domain names and categories of URLs from reverse search results; factuality ratings of news sources (from Media Bias/Fact Check database), classified as True/False/Mixed media.

Claim features: TF-IDF vector representation of the claim text.

Relationship features: Cosine similarity and Universal Sentence Encoder embedding similarity between claim text and article titles from trustworthy media sources returned by reverse image search. Similarities are smoothed by averaging.

Classification uses a linear SVM trained separately for each feature group, then predictions are normalized via softmax and averaged to form the final decision.

Results¶

Snopes-only test: 63.2% accuracy, 73.0% average precision (AP) over 10 random folds with 50-example test sets.

Snopes + Reuters combined: 80.1% accuracy, 90.3% AP over balanced 100-example test sets.

Top-performing features: - URL domains: 78.6% accuracy (S) / 89.7% AP (S+R) - True media percentage: 74.6% accuracy (S+R) - Embedding similarity: 74.0% accuracy (S+R) - Claim text: 74.9% accuracy (S+R)

Weak features: false media percentage, false media titles (near-baseline performance).

On unseen data from Snopes Fauxtography (Feb–Apr 2019, 28 balanced examples), top-1 feature (URL domains) achieved 71.7% AP.

Connections¶

Related to Propaganda detection and Misinformation and fake news detection via claim verification framework.
Distinct from Image forensics — focuses on false claims about real images rather than detecting digital manipulation.
Uses reverse image search as a primary information source, similar to automated fact-checking methods that consult the Web.
Extends Multimodal fake news detection by addressing the image-claim relationship via web-based evidence.

Notes¶

Strengths: - Clearly articulates a gap: prior fact-checking work ignores image-related claims despite visual content's influence. - Well-motivated dataset combining Snopes (abundant false examples) and Reuters (high-quality true examples). - Thorough feature engineering across image, claim, and relationship dimensions; transparent about failure modes (image splice detection, EXIF metadata extraction). - Larger improvements on combined dataset (80% vs 63%) underscore the value of the Reuters data.

Weaknesses: - Class imbalance in Snopes (641 false vs 197 true) reflects real distribution but complicates evaluation; Reuters examples are longer and from a narrow domain (award-winning photography). - Reverse image search depends on Google's API; results are noisy and may exclude niche or newly uploaded images. - Feature engineering is heuristic; embedding similarity relies on sentence encoders trained on general text, not claim-specific semantics. - No comparison to FauxBuster (prior social-media-based approach) due to unavailable dataset. - Image forensic approaches (splice detection, error-level analysis) failed empirically, suggesting the task may be heavily skewed toward web-based textual features.

Open questions: - How does performance generalize to non-English claims or less mainstream image sources? - Can the method scale to real-time claim verification pipelines?