Skip to content

User profiles for fake news detection

User profile features are attributes derived from a news sharer's social media account — both metadata available directly from APIs (explicit features) and attributes inferred from behavioral signals such as tweet history or profile images (implicit features). The core hypothesis is that users who systematically spread fake news differ demographically and behaviorally from those who share real news, and that these differences are detectable at scale.

Key papers

Key concepts

  • Explicit features: directly available from API metadata — account age (RegisterTime), verified status, follower/following counts, post count, favorite count.
  • Implicit features: inferred from content and behavior — age, personality (Big Five), geolocation, profile image object type, political bias score.
  • Fake news Ratio (FR): \(FR(i) = n_i^{(f)} / (n_i^{(r)} + n_i^{(f)})\), a per-user score used to identify representative fake-news and real-news propagators.
  • UPF vector: news-level feature formed by averaging per-user feature vectors across all users who shared a given item.

Connections

  • Closely related to social-context detection; user profiles are one component of the social context.
  • Political bias is a particularly discriminative implicit feature.
  • Bot filtering (e.g., Botometer) is typically a prerequisite before profile-based analysis to avoid noise from automated accounts.