Skip to content

Sequence Modeling

Sequence modeling refers to the task of learning statistical patterns in sequential data (text, audio, time series, etc.). Neural approaches to sequence modeling have evolved from recurrent architectures (RNNs, LSTMs) to attention-based models (Transformers).

Foundational architectures

  • Attention Is All You Need — introduces the Transformer, replacing recurrence with pure self-attention for sequence transduction.