GPT-3 few-shot benchmarks

Question

Rahul Pal · Accepted Answer

## Overview **GPT-3** is a large-scale autoregressive language model that demonstrates remarkable **few-shot learning** capabilities across a wide range of NLP tasks. Brown et al. (2020) established that GPT-3, with 175 billion parameters, achieves strong few-shot performance on many NLP datasets. A critical finding in subsequent research is that these empirical results depend heavily on the choice of in-context examples used to construct the prompt. Liu et al. (2021) ## Key Concepts - **Transformer Architecture** — The foundational network architecture based solely on attention mechanisms, dispensing with recurrence and convolutions, enabling highly parallelizable sequence modeling. Vaswani et al. (2017) - **GPT-3 (175B)** — A 175-billion-parameter autoregressive language model pre-trained via generative pre-training on diverse unlabeled text, achieving strong few-shot performance across NLP benchmarks. Brown et al. (2020) - **GPT (Generative Pre-Training)** — Demonstrates that large gains on NLP tasks can be realized by generative pre-training of a language model on a diverse corpus of unlabeled text. Radford et al. (2018) - **In-Context Example Selection** — The strategy by which few-shot prompts are constructed; retrieval-based selection of semantically similar examples consistently outperforms random sampling for GPT-3. Liu et al. (2021) ## System Architecture ``` ┌─────────────────────────────────────────────────────┐ │ GPT-3 Few-Shot Inference Pipeline │ │ │ │ Test Sam

Aspect	Random Sampling	Retrieval-Based Selection
Example relevance	Low (random)	High (semantically similar)
Benchmark performance	Baseline	Consistently higher
Encoder type	N/A	Task-fine-tuned encoders best

GPT-3 few-shot benchmarks

Overview

Key Concepts

System Architecture

Technical Details / Comparison

Limitations

Key Takeaways

What To Search Next

Research smarter with AI-powered citations