Skip to content

Language Models

Language models are statistical or neural models that assign probabilities to sequences of tokens (words, characters, or subword units). They are trained to predict the next token given a sequence of preceding tokens, typically using maximum likelihood estimation.

Key papers