Skip to content

Domain adaptation

Domain adaptation refers to machine learning techniques that adjust models trained in one domain to perform well in a different domain, typically when labeled data is scarce or unavailable in the target domain. In fake news detection, domain adaptation enables models to handle shifts in topic, source, platform, language, or temporal distribution without full retraining.

Approaches

  • Adversarial domain adaptation: Learn domain-invariant feature representations via minimax training, forcing a feature extractor to fool a domain classifier. Example: EANN.
  • Domain discovery: Unsupervised identification of distinct domains (e.g., clusters of topically similar records) in unlabeled data, enabling targeted training. Example: Silva et al. use propagation network community structure to discover domains.
  • Instance weighting: Reweight labeled source-domain data to match the target domain's distribution, reducing distribution shift penalties.
  • Fine-tuning: Pretrain on source domain(s), then fine-tune on small labeled target-domain sample.

Key papers

  • Wang et al. (2018) — EANN: Foundational work formulating fake news detection as domain adaptation; adversarial training to learn event-invariant features; achieves 71.5% accuracy on Twitter.

  • Silva et al. (2021): Extends adversarial approaches with explicit preservation of domain-specific knowledge (via domain-specific decoder) and cross-domain knowledge (via shared decoder); unsupervised domain discovery eliminates need for manual domain labels.

Connections