Skip to content

Large Language Models

Large language models (LLMs) are transformer-based neural language models trained on massive corpora of text to achieve state-of-the-art performance on a diverse array of downstream NLP tasks. Scaling laws suggest that performance improves smoothly as model capacity increases, with emerging capabilities appearing at certain scale thresholds.

Key papers