2022 Q1Quarterly Review11 min read

AI & Tech Review ⚡

Q1 2022 rewired AI's competitive logic around two pivots: InstructGPT proved alignment is a capability multiplier (not a constraint), and Chinchilla proved compute-optimal training beats brute-force scaling. Together they collapsed the "bigger model wins" thesis and set the agenda for every training run through 2024. Meanwhile, text-to-image generation reached commercial threshold with Midjourney's closed beta, and OpenAI maintained the only frontier model API on the market.

📌 Navigate

01📋 Exec Summary 02📊 What Moved 03📈 Trend Arcs 04🗺️ Landscape Shift 05💰 Funding & Deal Pattern 06🔍 The Counter-Narrative 07📐 Builder's Benchmark 08👀 What to Watch 09📎 Sources

📋 Exec Summary

📊 What Moved

InstructGPT and the alignment reframe
OpenAI published RLHF-trained InstructGPT (January 27): a 1.3B parameter model that human evaluators consistently preferred over 175B GPT-3 on helpfulness, truthfulness, and harmlessness. Alignment is not a constraint on capability — it is a multiplier.

The Chinchilla scaling law
DeepMind's March paper showed that for every doubling of parameters, you need a doubling of training tokens. Chinchilla (70B, 1.4T tokens) beat Gopher (280B, 300B tokens) at one-quarter the size, rewriting the cost structure of frontier model training toward efficiency over scale.

DALL-E 2 preannouncement conditions
The diffusion research underpinning DALL-E 2 (CLIP, classifier-free guidance, cascaded diffusion) was accumulating through Q1. Developers reading arXiv could see photorealistic text-to-image was weeks away at quarter close.

Open-source model access expands
EleutherAI continued shipping GPT-NeoX and Pythia families, the only open-weight frontier-adjacent models available. The gap to closed frontier was large, but the infrastructure for the open-weight movement was being laid.

Compute costs begin to differentiate
A100 remained the frontier chip (H100 announced late in Q1, but not broadly available yet). Cloud pricing for 10-100B parameter training diverged 30-50% across providers, making infrastructure selection a strategic variable for the first time.

📈 Trend Arcs

Arc 1: Alignment Research Enters the Product Layer

Velocity: Accelerating

January 2022 was the moment alignment research crossed from academic safety concern to product differentiator. Prior to InstructGPT, the dominant assumption was that RLHF and related preference-learning techniques were safety mechanisms — guardrails on dangerous outputs, not tools for making models more useful. The InstructGPT paper inverted this framing by demonstrating that a 1.3B parameter model trained with human feedback could produce outputs that human evaluators preferred over GPT-3 outputs at 175B parameters on open-ended generation, QA, and instruction following. The finding meant alignment was not in tension with capability — it was the mechanism through which capability became legible to users.

Throughout Q1, the research community absorbed this result and began extending it. Papers from Anthropic's early work, DeepMind's work on reward modeling, and independent researchers on the safety-capabilities-overlap expanded the surface area of RLHF as a technique. By March, the major AI labs had either adopted RLHF as a core training component or were actively experimenting with it. The alignment-product convergence that defined 2023's assistant race was locked in during Q1 2022, nine months before the product that made it visible to the public.

Where it stands at quarter close: InstructGPT is deployed on OpenAI's API as the default completion model. Anthropic has incorporated RLHF into its core research program. The technique is no longer experimental — it is the new baseline for fine-tuning frontier language models.

Arc 2: Compute-Optimal Training Replaces Brute-Force Scaling

Velocity: Accelerating (from near zero)

Through 2020 and 2021, the AI field's dominant strategy was the one the Kaplan scaling laws seemed to endorse: train bigger models. The arms race had a clear scoreboard — parameter count — and every major lab competed on it. GPT-3 at 175 billion parameters (2020) was followed by Gopher at 280 billion (December 2021), Megatron-Turing NLG at 530 billion (January 2022), and PaLM at 540 billion (announced April 2022, largely trained through Q1). The assumption embedded in all of this was that more parameters meant better performance and that compute was best spent on model size rather than data volume.

Chinchilla broke the arc in March. The paper was precise: given a fixed compute budget, you should train a model roughly equal in parameter count to the number of training tokens (loosely — the exact formula is budget-dependent). For a model the size of Gopher, compute-optimal training would have used 1.4 trillion tokens instead of 300 billion. The conclusion: Gopher was undertrained by a factor of nearly 5x on data relative to its compute budget. Every model trained under the Kaplan assumptions was similarly miscalibrated.

The velocity shift was structural. Labs could not easily rerun their flagship models under the new regime mid-quarter. But the paper set the agenda for everything trained after March 2022. LLaMA (February 2023) was explicitly Chinchilla-scaled. Mistral's series (2023) was Chinchilla-scaled. The open-weight movement's viability — competitive quality at small model sizes — depended on a scaling law published in Q1 2022.

Where it stands at quarter close: The Chinchilla paper is public and widely circulated in the research community. No production models have yet been retrained under its recommendations. The impact will be felt in Q2–Q4 2022 as new training runs are designed.

Arc 3: Text-to-Image Generation Reaches Commercial Readiness

Velocity: Accelerating

DALL-E (the original, January 2021) demonstrated that text-to-image generation was possible at scale. Throughout 2021, diffusion models — GLIDE, CLIP-guided diffusion, and related work — improved on the GAN-based approaches that had dominated generative image research. By Q1 2022, the capability curves in the research literature were pointing toward commercial-quality generation within quarters, not years.

The key developments through Q1 2022 were less about headline releases and more about the accumulation of techniques: classifier-free guidance improving sample quality dramatically, CLIP embeddings providing the text-image alignment mechanism, and cascaded diffusion architectures enabling high-resolution outputs. DALL-E 2's April release drew directly on this stack — most of the underlying components were public research by March 31, 2022.

The practical signal for builders: if you were reading arXiv through Q1 2022, the arrival of commercial text-to-image generation was not a surprise. It was a predicted outcome from a research pipeline whose components were visible. The builders who got early access to DALL-E 2, Midjourney (launched March 2022 in closed beta), and Stable Diffusion (August 2022) had their first-mover advantage because they were reading primary sources rather than waiting for product launches.

Midjourney quietly launched its closed Discord beta in March 2022. This was the first commercial text-to-image product accessible to non-researchers. It was rough, slow, and hard to use. It also had a waitlist that filled immediately.

Where it stands at quarter close: Midjourney is in closed beta. DALL-E 2 is days away from announcement. Stable Diffusion is in training. The text-to-image arc has not yet crossed into mainstream visibility but is at commercial threshold.

🗺️ Landscape Shift

The competitive map in Q1 2022 was still organized around a simple axis: who had the biggest model. By March 31, the Chinchilla paper had introduced a second axis — compute efficiency — that would reorganize the field over the following 18 months.

Player	Position at quarter open	Position at quarter close	What changed
OpenAI	Dominant via GPT-3 API monopoly and Codex	Dominant, now also with InstructGPT deployed as default; RLHF as production technique	Moved from "most capable" to "most aligned + most capable"
DeepMind	Strong research credibility (Gopher, AlphaCode released Jan 2022)	Strong research credibility + Chinchilla paper shifts field agenda	Authored the scaling law that will constrain every competitor's next training run
Anthropic	Pre-product, publishing Constitutional AI and RLHF research	Pre-product, establishing RLHF credentials through research	Still building; research output is positioning for future product launch
Google Brain / LaMDA	LaMDA in limited demo phase; no API access	Unchanged; FLAN paper (instruction fine-tuning, Oct 2021) continues circulating	Behind on deployment despite strong research base
Meta AI	Pre-LLaMA; OPT model in development	OPT paper published May 2022; LLaMA in future training	Not yet a factor in deployment; groundwork being laid
EleutherAI	Only source of open frontier-adjacent weights	Same position, larger community	Open-weight ecosystem growing but still niche
Stability AI	Not yet publicly known	Not yet publicly known	Stable Diffusion in development, announcement months away
Midjourney	Not public	Closed Discord beta launched March 2022	First commercial text-to-image product to reach users

The structural shift: OpenAI expanded its moat not by releasing a bigger model but by releasing a better-aligned smaller model. DeepMind raised its research credibility without a product deployment. Anthropic remained in the research phase. The absence of a strong third commercial API competitor to OpenAI was the defining structural fact of Q1 2022 — a gap that would not close until mid-2023.

💰 Funding & Deal Pattern

Q1 2022 AI investment remained elevated but showed early signs of the correction that would intensify through 2022. The 2021 SPAC-fueled environment had begun cooling in public markets by Q4 2021; that cooling reached private venture with a lag.

Concentration at Series B and C
Early-stage check sizes contracting as lead investors recalibrated. Later-stage rounds holding up for companies with demonstrable revenue, enterprise contracts, or regulatory clearance.

Infrastructure over applications
Dominant capital flow toward compute, MLOps, and data infrastructure. Runway's $50M Series C (February 2022) was a rare application-layer exception.

Strategic corporates as price setters
Exscientia-Sanofi set the ceiling on drug-discovery AI valuation. Microsoft's deepening OpenAI relationship was the dominant general AI strategic capital story — the only major cloud provider with a live frontier model bet.

The consensus break
The 2020-2021 thesis that application layer captures most AI value came under pressure. InstructGPT showed alignment was a durable differentiator, not easily replicated by fine-tuning a commodity base model. Investors began repositioning toward foundation model providers.

🔍 The Counter-Narrative

The consensus: Bigger models win; parameter count is the scoreboard. The reality: Three consecutive quarters of evidence (FLAN, InstructGPT, Chinchilla) showed smaller, better-trained, better-aligned models beat bigger raw models. The builders who internalized this were reading primary sources while everyone else read press releases.
The consensus: Open-source AI is years behind frontier. The reality: The capability gap was real, but the enabling conditions for parity were already in motion — EleutherAI proved open training at scale was feasible, Chinchilla made it economically viable, and Meta's OPT and Stable Diffusion were in development. The gap's closure was scheduled, not speculative.

📐 Builder's Benchmark

API pricing (Q1 2022 reference points):

OpenAI GPT-3 (Davinci): $0.02 per 1K tokens (approximately $20 per million tokens)
OpenAI Codex: Free during beta (launched August 2021, still in beta access)
No other frontier model APIs available commercially
Token pricing context: a typical enterprise use case (document summarization, classification) at scale would cost $2,000–$10,000 per month at Davinci pricing; the cost made many applications economically marginal

Performance benchmarks that shifted this quarter:

InstructGPT 1.3B outperforms GPT-3 175B on human preference evaluation for instruction-following tasks: a 130x parameter reduction with positive preference outcome
AlphaCode (DeepMind, released January 2022): reaches roughly 50th percentile on competitive programming benchmarks — first model to demonstrate competitive-level code generation
Chinchilla (70B) outperforms Gopher (280B) on MMLU, BIG-bench, and the majority of NLP benchmarks evaluated; parameter efficiency as a benchmark axis introduced formally

Adoption curves:

OpenAI API developer accounts: estimated 1M+ by Q1 2022 close (not publicly disclosed; inferred from product blog references)
GitHub Copilot: still in technical preview; not yet generally available (GA launched June 2022)
Midjourney Discord: closed beta launched March 2022; waitlist filled within days

Open-source vs. closed gap:

Capability gap: large. Best open weights (EleutherAI GPT-NeoX-20B) vs. GPT-3.5 equivalent: significant quality difference on most tasks
Infrastructure gap: narrowing. Open training tooling (Megatron-DeepSpeed, accelerate library) improving rapidly
Economic gap: closing. Chinchilla paper published March 2022 — open-weight teams now have the scaling law that makes smaller, cheaper training competitive

👀 What to Watch

DALL-E 2 announcement (April 6, 2022)
the signal is the access structure (waitlist, API, or open release), not the capability demo; access choice determines how quickly the application ecosystem forms.

GitHub Copilot GA (expected Q2 2022)
Microsoft/GitHub moving Copilot from technical preview to paid product; pricing and enterprise tier structure set the benchmark for AI-assisted development pricing.

Chinchilla replication (ongoing)
watch arXiv for training compute papers referencing the Chinchilla ratio; independent validation from labs outside DeepMind determines whether the scaling law becomes field canon.

PaLM release (expected Q2 2022)
Google Brain's 540B parameter Pathways Language Model; benchmark performance relative to Chinchilla-optimal baselines is the first public test of whether large labs are adjusting training regimens.

Anthropic product timeline (ongoing)
research output (Constitutional AI, preference model work) is consistent with a team preparing deployment; watch for hiring signals, partnership announcements, or early access invitations.

📎 Sources

Key references for this quarter. Links provided where available; historical entries may reference publications by title and date.

Source	Reference	Link
OpenAI	"Training language models to follow instructions with human feedback" (InstructGPT), January 2022	https://arxiv.org/abs/2203.02155
DeepMind	"Training Compute-Optimal Large Language Models" (Chinchilla), March 2022	https://arxiv.org/abs/2203.15556
DeepMind	AlphaCode — competitive programming AI, January 2022	https://deepmind.google/discover/blog/competitive-programming-with-alphacode/
EleutherAI	GPT-NeoX and Pythia model families, ongoing Q1 2022	https://github.com/EleutherAI/gpt-neox
Wei et al.	"Finetuned Language Models Are Zero-Shot Learners" (FLAN), October 2021	https://arxiv.org/abs/2109.01652
Midjourney	Closed Discord beta launched March 2022	https://www.midjourney.com/
OpenAI	DALL-E 2 preannouncement conditions, Q1 2022 research pipeline	https://openai.com/dall-e-2
NVIDIA	GTC March 2022 — H100 GPU announcement	https://www.nvidia.com/gtc/
Kaplan et al.	"Scaling Laws for Neural Language Models," January 2020	https://arxiv.org/abs/2001.08361

2022 Q1Quarterly Review11 min read

AI & Tech Review ⚡

📌 Navigate

📋 Exec Summary

📊 What Moved

📈 Trend Arcs

Arc 1: Alignment Research Enters the Product Layer

Velocity: Accelerating

Arc 2: Compute-Optimal Training Replaces Brute-Force Scaling

Velocity: Accelerating (from near zero)

Arc 3: Text-to-Image Generation Reaches Commercial Readiness

Velocity: Accelerating

🗺️ Landscape Shift

Player	Position at quarter open	Position at quarter close	What changed
OpenAI	Dominant via GPT-3 API monopoly and Codex	Dominant, now also with InstructGPT deployed as default; RLHF as production technique	Moved from "most capable" to "most aligned + most capable"
DeepMind	Strong research credibility (Gopher, AlphaCode released Jan 2022)	Strong research credibility + Chinchilla paper shifts field agenda	Authored the scaling law that will constrain every competitor's next training run
Anthropic	Pre-product, publishing Constitutional AI and RLHF research	Pre-product, establishing RLHF credentials through research	Still building; research output is positioning for future product launch
Google Brain / LaMDA	LaMDA in limited demo phase; no API access	Unchanged; FLAN paper (instruction fine-tuning, Oct 2021) continues circulating	Behind on deployment despite strong research base
Meta AI	Pre-LLaMA; OPT model in development	OPT paper published May 2022; LLaMA in future training	Not yet a factor in deployment; groundwork being laid
EleutherAI	Only source of open frontier-adjacent weights	Same position, larger community	Open-weight ecosystem growing but still niche
Stability AI	Not yet publicly known	Not yet publicly known	Stable Diffusion in development, announcement months away
Midjourney	Not public	Closed Discord beta launched March 2022	First commercial text-to-image product to reach users

💰 Funding & Deal Pattern

Infrastructure over applications
Dominant capital flow toward compute, MLOps, and data infrastructure. Runway's $50M Series C (February 2022) was a rare application-layer exception.

🔍 The Counter-Narrative

The consensus: Bigger models win; parameter count is the scoreboard. The reality: Three consecutive quarters of evidence (FLAN, InstructGPT, Chinchilla) showed smaller, better-trained, better-aligned models beat bigger raw models. The builders who internalized this were reading primary sources while everyone else read press releases.
The consensus: Open-source AI is years behind frontier. The reality: The capability gap was real, but the enabling conditions for parity were already in motion — EleutherAI proved open training at scale was feasible, Chinchilla made it economically viable, and Meta's OPT and Stable Diffusion were in development. The gap's closure was scheduled, not speculative.

📐 Builder's Benchmark

API pricing (Q1 2022 reference points):

OpenAI GPT-3 (Davinci): $0.02 per 1K tokens (approximately $20 per million tokens)
OpenAI Codex: Free during beta (launched August 2021, still in beta access)
No other frontier model APIs available commercially
Token pricing context: a typical enterprise use case (document summarization, classification) at scale would cost $2,000–$10,000 per month at Davinci pricing; the cost made many applications economically marginal

Performance benchmarks that shifted this quarter:

InstructGPT 1.3B outperforms GPT-3 175B on human preference evaluation for instruction-following tasks: a 130x parameter reduction with positive preference outcome
AlphaCode (DeepMind, released January 2022): reaches roughly 50th percentile on competitive programming benchmarks — first model to demonstrate competitive-level code generation
Chinchilla (70B) outperforms Gopher (280B) on MMLU, BIG-bench, and the majority of NLP benchmarks evaluated; parameter efficiency as a benchmark axis introduced formally

Adoption curves:

OpenAI API developer accounts: estimated 1M+ by Q1 2022 close (not publicly disclosed; inferred from product blog references)
GitHub Copilot: still in technical preview; not yet generally available (GA launched June 2022)
Midjourney Discord: closed beta launched March 2022; waitlist filled within days

Open-source vs. closed gap:

Capability gap: large. Best open weights (EleutherAI GPT-NeoX-20B) vs. GPT-3.5 equivalent: significant quality difference on most tasks
Infrastructure gap: narrowing. Open training tooling (Megatron-DeepSpeed, accelerate library) improving rapidly
Economic gap: closing. Chinchilla paper published March 2022 — open-weight teams now have the scaling law that makes smaller, cheaper training competitive

👀 What to Watch

📎 Sources

Key references for this quarter. Links provided where available; historical entries may reference publications by title and date.

Source	Reference	Link
OpenAI	"Training language models to follow instructions with human feedback" (InstructGPT), January 2022	https://arxiv.org/abs/2203.02155
DeepMind	"Training Compute-Optimal Large Language Models" (Chinchilla), March 2022	https://arxiv.org/abs/2203.15556
DeepMind	AlphaCode — competitive programming AI, January 2022	https://deepmind.google/discover/blog/competitive-programming-with-alphacode/
EleutherAI	GPT-NeoX and Pythia model families, ongoing Q1 2022	https://github.com/EleutherAI/gpt-neox
Wei et al.	"Finetuned Language Models Are Zero-Shot Learners" (FLAN), October 2021	https://arxiv.org/abs/2109.01652
Midjourney	Closed Discord beta launched March 2022	https://www.midjourney.com/
OpenAI	DALL-E 2 preannouncement conditions, Q1 2022 research pipeline	https://openai.com/dall-e-2
NVIDIA	GTC March 2022 — H100 GPU announcement	https://www.nvidia.com/gtc/
Kaplan et al.	"Scaling Laws for Neural Language Models," January 2020	https://arxiv.org/abs/2001.08361

📌 Navigate

📋 Exec Summary

📊 What Moved

📈 Trend Arcs

Arc 1: Alignment Research Enters the Product Layer

Arc 2: Compute-Optimal Training Replaces Brute-Force Scaling

Arc 3: Text-to-Image Generation Reaches Commercial Readiness

🗺️ Landscape Shift

💰 Funding & Deal Pattern

🔍 The Counter-Narrative

📐 Builder's Benchmark

👀 What to Watch

📎 Sources

More AI & Tech

📌 Navigate

📋 Exec Summary

📊 What Moved

📈 Trend Arcs

Arc 1: Alignment Research Enters the Product Layer

Arc 2: Compute-Optimal Training Replaces Brute-Force Scaling

Arc 3: Text-to-Image Generation Reaches Commercial Readiness

🗺️ Landscape Shift

💰 Funding & Deal Pattern

🔍 The Counter-Narrative

📐 Builder's Benchmark

👀 What to Watch

📎 Sources

More AI & Tech