A mediocre agent inside a strong harness outperforms a stronger agent inside a messy one
The surrounding machinery — metrics, rollback, scoping, observability — determines autonomous system performance more than model capability
Manthan Gupta (@manthanguptaa) — How Karpathy's Autoresearch Works And What You Can Learn From It · · 25 connections
Connected Insights
References (8)
→ Agents learn at three distinct layers — model weights, harness code, and context configuration → Autonomous coding loops need small stories and fast feedback to work → Compound engineering makes each unit of work improve all future work → Declarative beats imperative when working with agents → Intelligence location — code vs prompts — determines system fragility and flexibility → Meta-agents that autonomously optimize task agents beat hand-engineered harnesses on production benchmarks → Open harnesses with customer-owned databases are the antidote to model-provider lock-in → Verification is the single highest-leverage practice for agent-assisted coding
Referenced by (17)
← Research speed is mostly the speed at which you discover you're wrong — which makes tooling a first-class research activity ← Engineering is no longer the junior partner — at the frontier, research and engineering have fused ← Verification is the single highest-leverage practice for agent-assisted coding ← Agents learn at three distinct layers — model weights, harness code, and context configuration ← Stronger models expand the verification gap, not close it ← Evolved harnesses transfer across models — a single optimized harness improves five different LLMs ← Every optimization has a shadow regression — guard commands make the shadow visible ← Evals are the gradient signal for harness engineering — the same data quality rigor from ML training applies ← Rollback safety nets enable autonomous iteration — not model intelligence ← Harness engineering — humans steer, agents execute, documentation is the system of record ← Time-bounded evaluation forces optimization for real-world usefulness instead of idealized performance ← Detect everything, notify selectively — the observability-to-notification ratio determines system trust ← Meta-agents that autonomously optimize task agents beat hand-engineered harnesses on production benchmarks ← Self-improving agents overfit to eval metrics — the meta-agent games rubrics unless structurally constrained ← Intelligence location — code vs prompts — determines system fragility and flexibility ← Agent harnesses are persistent infrastructure, not scaffolding models will absorb ← Open harnesses with customer-owned databases are the antidote to model-provider lock-in