Zero-shot learning¶

Zero-shot learning is the ability to perform a task without any task-specific training examples. In the context of language models, zero-shot approaches leverage knowledge acquired during pretraining to solve new tasks described via natural language instructions. Zero-shot performance avoids the costs and data requirements of task-specific supervision while reducing the risk of spurious feature-label correlations.

Key papers¶

Toxicity Detection with Generative Prompt-based Inference — Demonstrates zero-shot toxicity detection via prompt-based inference without any toxicity-labeled supervision

Prompt-based methods — how zero-shot tasks are specified and performed
Few-Shot Learning — extending to small numbers of examples
Language Models — the models enabling zero-shot generalization
Transfer learning for fake news detection — broader notion of using pretraining for new tasks

Zero-shot learning¶

Key papers¶

Related topics¶