Recurrent Neural Networks¶

Recurrent Neural Networks (RNNs) are a class of neural networks designed to process sequential data by maintaining and updating an internal hidden state at each time step. This architecture enables the network to capture dependencies and patterns across time or across the sequence.

Key architectures¶

Standard RNN: At each time step, the hidden state is computed from the previous hidden state and the current input via a non-linear activation function (typically tanh). Standard RNNs suffer from vanishing gradient problems when sequences are long, limiting their ability to learn long-range dependencies.

Long Short-Term Memory (LSTM): Introduced by Hochreiter & Schmidhuber (1997), LSTMs address the vanishing gradient problem through a gating mechanism: input gates, forget gates, output gates, and a memory cell. This architecture enables the network to selectively retain or discard information over long sequences.

Gated Recurrent Unit (GRU): Proposed by Cho et al. (2014), GRUs simplify LSTMs by using only update and reset gates (no separate memory cell), making them computationally cheaper while retaining the ability to learn long-range dependencies.

Bidirectional variants: In many NLP tasks, context from both directions (past and future) is valuable. Bidirectional RNNs process the sequence in both directions and concatenate the outputs, enabling the network to access full context.

Applications in misinformation and fake news detection¶

Text classification: RNNs capture sequential linguistic patterns in headlines and articles
Rumor detection: LSTMs model rumor propagation patterns in social media sequences
Stance detection: Bidirectional RNNs with attention capture dependencies between comment and target text
Clickbait detection: Character and word embeddings fed to RNNs learn patterns of misleading headlines

Key papers¶

We used Neural Networks to Detect Clickbaits: You won't believe what happened Next! — Bidirectional LSTM with word and character embeddings for clickbait detection
DeepCas: an End-to-end Predictor of Information Cascades — Bidirectional GRU with attention for information cascade prediction
CSI: A Hybrid Deep Model for Fake News Detection — LSTM-based hybrid model for fake news detection combining temporal and user patterns

Deep learning — broader framework
Neural networks — general architecture class
Text embeddings and word representations — word and character representation methods

Recurrent Neural Networks¶

Key architectures¶

Applications in misinformation and fake news detection¶

Key papers¶

Related topics¶