Media profiling¶
Media profiling involves assessing the factuality and bias of entire news outlets rather than evaluating individual claims or articles. The key insight is that outlets with consistent records of publishing false or heavily biased content are likely to continue doing so, enabling rapid detection of potentially unreliable content the moment it is published by simply checking the source.
Rationale¶
Manual fact-checking of every suspicious claim is infeasible at scale. Viral misinformation spreads 6× faster than true news, with over 50% of sharing happening within the first ten minutes. Source-level profiling allows detection without waiting for evidence accumulation, making it especially valuable for time-sensitive contexts.
Approaches¶
Textual features: Linguistic markers (sentiment, hedging, subjectivity) computed over articles published by a medium; averaged embeddings from BERT or Sentence-BERT; posterior probability aggregation.
Multimedia analysis: Visual characteristics of images (deep learning representations, reverse image search provenance); video and audio forensics for deepfakes and manipulated media.
Audience homophily: Ideological profiles of followers on Twitter, Facebook, or YouTube; audience distribution across the political spectrum; self-descriptions of users following media accounts.
Infrastructure characteristics: Domain registration patterns, DNS/certificate metadata, web design features, shared data objects (scripts, images) across websites. Content-agnostic and useful for nascent outlets with limited published material.
Key papers¶
- Predicting Factuality of Reporting and Bias of News Media Sources: foundational work predicting media-level factuality and political bias from article text, Wikipedia, Twitter, URLs, and web traffic; releases 1,066-website dataset
- Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media: ordinal regression framework jointly modeling outlet trustworthiness and political ideology; shows that extreme bias and low factuality are correlated
- What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context: combines article text, social media audience features, and Wikipedia content for outlet-level bias and factuality prediction
- A Survey on Predicting the Factuality and the Bias of News Media: comprehensive survey covering textual, multimedia, audience, and infrastructure approaches
- A Survey on Multimodal Disinformation Detection: multimodal approaches to factuality and harmfulness, including source-level indicators
Related topics¶
- Source reliability: assessing whether a source is trustworthy
- Political bias in fake news detection: ideological leaning of media outlets
- Media bias detection: detecting presentation and selection bias in news coverage
- Fact-checking and corrections: claim-level verification that benefits from source credibility