Skip to content

Computational social science and large-scale text analysis

Computational social science leverages large digitized datasets and machine learning to answer social questions at scales impractical for traditional qualitative or hand-coded methods. For misinformation research, this includes: Structural Topic Modeling to identify discourse themes from massive text corpora, social network analysis to map organizational influence and information flow, and temporal analysis of how messaging evolves in response to events or funding.

Key advantages: Reproducibility via automated coding, ability to detect patterns missed by human analysts, and capacity to examine entire populations (e.g., all tweets, all press releases) rather than samples. Limitations include need for human interpretation of machine-discovered patterns and potential encoding of biases present in training data.

Key papers