Skip to content

Graph Neural Networks

Graph neural networks (GNNs) are neural network architectures that learn representations of graph-structured data by aggregating information from node neighborhoods. Unlike traditional neural networks designed for fixed-size inputs (images, sequences), GNNs operate on variable-size graphs where nodes can have different numbers of neighbors and connections.

Core concepts

Message passing: GNNs typically work by iteratively updating node representations by aggregating information from neighboring nodes (message passing). Different GNN variants (Graph Convolutional Networks, GraphSAGE, Graph Attention Networks) differ in their aggregation functions and update rules.

Graph structure preservation: GNNs aim to learn representations that preserve graph properties—nodes with similar neighborhood structures or roles should have similar representations. This is useful for both node-level tasks (classification, link prediction) and graph-level tasks (predicting properties of entire graphs).

Scalability challenge: A key research question is how to scale GNNs to very large graphs (millions/billions of nodes) where computing global representations requires careful subsampling and aggregation strategies.

Applications in cascade prediction and information diffusion

GNNs are particularly useful for cascade prediction because cascades are naturally represented as graphs (nodes = users, edges = information propagation). Rather than hand-engineering features of cascade graphs (clustering coefficients, centrality measures), GNNs can automatically learn which structural properties are most predictive.

DeepCas: an End-to-end Predictor of Information Cascades represents cascades as random walk paths through the cascade graph, processed through bidirectional GRU networks with attention—an early approach to end-to-end learning of cascade representations.

Key papers