AI Product Building AI Agents Knowledge Systems

Agents learn at three distinct layers — model weights, harness code, and context configuration

Most people jump to model fine-tuning when discussing agent learning, but learning also happens at the harness layer (code, tools, instructions baked into all instances) and the context layer (per-user or per-tenant configuration like CLAUDE.md and skills)

@hwchase17 (Harrison Chase) — Continual Learning for AI Agents · Apr 6, 2026 · 9 connections

Harrison Chase proposes a three-layer model for how AI agents improve over time: the model (weights updated via SFT, RL, GRPO), the harness (the surrounding code, instructions, and tools shared across all instances), and the context (configuration that sits outside the harness and customizes it per user or tenant).

The concrete mapping makes this tangible: for Claude Code, the model is claude-sonnet, the harness is Claude Code itself, and the context is CLAUDE.md, /skills, and mcp.json. The harness/context distinction matters because it determines who benefits — harness improvements affect every user, while context improvements are scoped to whoever owns that configuration.

This frames A mediocre agent inside a strong harness outperforms a stronger agent inside a messy one more precisely: the harness is the layer that powers all instances, so investing there has the highest leverage. Meanwhile, Compound engineering makes each unit of work improve all future work operates primarily at the context layer — each session’s learnings update CLAUDE.md and skills, compounding for that specific user. The model layer is largely outside the control of agent builders, making harness and context the actionable surfaces for improvement. This also reframes Meta-agents that autonomously optimize task agents beat hand-engineered harnesses on production benchmarks — Meta-Harness and AutoAgent are explicitly automating learning at the harness layer. The durability of this investment is reinforced by Agent harnesses are persistent infrastructure, not scaffolding models will absorb — if harnesses were transitional scaffolding that stronger models would absorb, harness-layer learning would be a waste; but the 512k LOC inside Claude Code argues the opposite.

Connected Insights

References (4)

→ Agent harnesses are persistent infrastructure, not scaffolding models will absorb → Compound engineering makes each unit of work improve all future work → A mediocre agent inside a strong harness outperforms a stronger agent inside a messy one → Meta-agents that autonomously optimize task agents beat hand-engineered harnesses on production benchmarks

Referenced by (5)

← Evolved harnesses transfer across models — a single optimized harness improves five different LLMs ← Procedural memory is the highest-impact type of agent memory — it determines what the agent actually does ← Evals are the gradient signal for harness engineering — the same data quality rigor from ML training applies ← A mediocre agent inside a strong harness outperforms a stronger agent inside a messy one ← Agent harnesses are persistent infrastructure, not scaffolding models will absorb