Sources#
Summary#
Andrej Karpathy's answer to "what's still worth learning deeply when intelligence gets cheap," built on a tweet he keeps returning to: "you can outsource your thinking, but you can't outsource your understanding." Information still has to make it into your brain. The human becomes the bottleneck on knowing what to build, why it's worth doing, and how to direct the agents — and you can't be a good director without understanding, because the LLMs themselves "don't excel at understanding." Understanding is the residual, non-delegable human capacity in an agentic world.
Thinking vs. understanding#
- Thinking = the processing, the generation of intermediate steps, the search. Delegable to agents.
- Understanding = the internal model that lets you direct, judge, and verify. Not delegable — "you still are uniquely in charge of that."
The human is "becoming a bottleneck of even knowing what we're trying to build, why is it worth doing, and how do I direct my agents." This is the same residual the rest of the corpus keeps naming from different angles: taste/spec/oversight in Vibe Coding vs. Agentic Engineering, the Compute Allocator role in HTML as the New Markdown, product taste in Engineer PM Convergence.
Knowledge bases as understanding-tools#
Karpathy ties the thesis directly to the LLM-wiki pattern he originated: building a personal wiki from the articles he reads, then "asking questions about things." His mechanism: "anytime I see a different projection onto information, I gain insight" — he treats wiki-building as synthetic data generation over fixed data, a way to force information into his head. This is the founder of the pattern explaining why it works — not retrieval, but the re-compilation that produces understanding. (This vault is a literal instance.)
The "you must understand to direct" loop#
The argument is circular in a load-bearing way: agents do the thinking → but agents can't supply understanding → so the human must understand to direct the thinking → so tools that build human understanding (knowledge bases, good projections of information) become the highest-leverage investment. The bottleneck isn't compute or model quality; it's how fast a human can genuinely understand. He ends hoping to return "in a couple years to see if they've automated understanding too" — flagging it as the open frontier.
The simplicity tell#
His nanoGPT-simplification anecdote doubles as an understanding example: he understands what minimal, clean LLM-training code should look like; the model can't produce it ("they hate this, they can't do it"). His understanding exceeds the model's in a domain outside its RL circuits — exactly where the human's non-outsourceable understanding earns its keep.
Connections#
- Andrej Karpathy — the education thesis; his closing answer
- LLM-as-Compiler Knowledge Base — the tool he uses to build understanding; "different projection → insight" is the why behind this whole vault
- Vibe Coding vs. Agentic Engineering — taste/spec/oversight is understanding applied to direction
- Compute Allocator — Thariq Shihipar's framing of the residual human role; deciding what's worth compute requires understanding
- Jagged Intelligence (Ghosts, Not Animals) — the nanoGPT-simplification case: human understanding exceeds the ghost where it's out-of-distribution
- Engineer PM Convergence — product taste as the durable human skill is understanding in the product register
- AI-Driven Formal Proof Search — DeepMind found formal proof sketches deepened mathematicians' understanding even when unproven: AI as an understanding tool, exactly this thesis
- Design Concept Grilling — reaching the Brooks "design concept" before planning is forcing understanding to precede thinking
- AI Brain Fry — the failure mode when oversight outpaces understanding: rubber-stamping without comprehending
Open Questions#
- Karpathy's open frontier: can "understanding" itself eventually be automated, or is it definitionally the human residue? His "back in a couple years" hedge leaves it open.
- If understanding is the bottleneck, is the highest-ROI skill learning how to build understanding fast (knowledge-base hygiene, asking the right projections) — and can that be taught?
Sources#
Cited by 8
- AI-Driven Formal Proof Search
LLM generates Lean, compiler verifies every step → eliminates hallucination; DeepMind resolves 9/353 Erdős + 44/492 OEI…
- Andrej Karpathy
Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…
- Building Is Cheap, Arguing Is Expensive
"In technical debate, code wins": generate three PRs vs whiteboard; prototype over design doc; reduce design docs
- Compute Allocator
The human's evolving role: deciding what's worth spending compute on; ~1% of generated tokens ship, 99% is scaffolding…
- Founder-Led Sales Discipline
Stay founder-led until PMF; don't offload sales to an AE *or* an agent; explicit tension with Founder As Agent Orchestr…
- Jagged Intelligence (Ghosts, Not Animals)
"Ghosts not animals": jagged statistical circuits, no intrinsic motivation; car-wash/strawberry failures; stay in the l…
- LLM-as-Compiler Knowledge Base
Karpathy's architecture: LLM incrementally compiles raw docs into a persistent interlinked wiki, replacing RAG with a 4…
- Vibe Coding vs. Agentic Engineering
Vibe coding raises the floor (anyone builds); agentic engineering preserves the quality bar while going faster; ">10x a…
Related articles
- Claude Code
Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…
- Boris Cherny
Creator of Claude Code at Anthropic; phone-driven workflow with hundreds of agents; primary advocate of `/loop` primiti…
- Software 3.0
Karpathy's taxonomy: 1.0 code, 2.0 weights, 3.0 prompting; LLM as programmable interpreter; MenuGen "shouldn't exist";…
- Harness Shrinkage as Models Improve
Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…
- The Verifiability Thesis
LLMs automate what you can *verify* as computers automate what you can *specify*; RL verification rewards → jagged peak…
