H
Howardismvol. 03 · quiet corner of the web
Plate IIArchitectureHOWARDISM

Software 3.0

PublishedMay 23, 2026FiledConceptTopicArchitectureTagsLLM ArchitectureSoftware ParadigmReading5 minSourceAI-synthesised

Karpathy's taxonomy: 1.0 code, 2.0 weights, 3.0 prompting; LLM as programmable interpreter; MenuGen "shouldn't exist"; neural-net-as-host-process extrapolation

Illustration for Software 3.0

Sources#

Summary#

Andrej Karpathy's taxonomy of three programming paradigms: Software 1.0 is explicit code; Software 2.0 is learned weights (you program by curating datasets, objectives, and architectures); Software 3.0 is prompting — the LLM is a programmable computer and "what's in the context window is your lever over the interpreter." Trained on the whole internet, the model implicitly multitasks every task in the data, becoming a general interpreter that performs computation in digital-information space. The deeper claim: 3.0 doesn't just make old programs faster, it makes whole classes of programs unnecessary — and enables information-processing tasks that couldn't exist before.

The three paradigms#

Software 1.0Software 2.0Software 3.0
You writeCodeDatasets + objectives + architecturesPrompts / context
Executed byCPUTrained neural netLLM as interpreter
The "program" isSourceWeightsThe context window

The OpenClaw installer example#

Installing a complex cross-platform tool used to mean a shell script that "balloons up and becomes extremely complex" to target every environment — Software 1.0 thinking. The 3.0 version: the install instructions are a block of text you copy-paste to your agent. The agent packages its own intelligence, inspects your machine, performs intelligent actions, and debugs in the loop. "What is the piece of text to copy-paste to your agent? That's the programming paradigm now." (See Agent-Native Infrastructure for the generalization.)

Karpathy built MenuGen — photograph a menu, OCR the items, generate images of each dish — as a real Vercel app with image-gen plumbing. Then he saw the 3.0 version: hand the photo to Gemini and say "use Nano Banana to overlay the dishes onto the menu," and the model returns the exact menu image with rendered dish pictures in the pixels. "All of my menu gen is spurious. That app shouldn't exist." The neural net subsumes the entire application; the prompt is just the image, the output is just the image, no app in between.

Beyond code: new information-processing tasks#

A subtler point: previous code operated over structured data. Software 3.0 enables operations that were never programs at all. His example is the LLM wiki: "there was no code that would create a knowledge base from a bunch of facts. Now you can take these documents and recompile them in a different way… something new as a reframing of the data." He calls this "more exciting" than mere speedup — not what we can do faster, but what couldn't be done before.

The extrapolation: neural net as host process#

Pushed to the limit: a "completely neural computer" — raw video/audio in, diffusion rendering a UI unique to that moment, the neural net as the host process and CPUs as the co-processor for deterministic appendages. He frames this as the 1950s–60s fork (calculator vs. neural net) flipping back: classical computing won the first round, neural nets are currently virtualized on it, and that relationship may invert. This is The Bitter Lesson taken to its architectural conclusion. He hedges that the path there is "TBD."

Connections#

Open Questions#

  • Where is the line between "the app shouldn't exist" (MenuGen) and apps that should — i.e., when is deterministic 1.0/2.0 scaffolding still the right call vs. spurious?
  • The neural-net-as-host-process flip is presented as plausible-but-TBD. What would the first production system that genuinely inverts the CPU/NN relationship look like?

Sources#

§ end
About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

Cited by 9
  • Agent-Native Infrastructure

    The world is still built for humans and must be rewritten for agents; "what do I copy-paste to my agent?"; sensors/actu…

  • Andrej Karpathy

    Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…

  • Compute Allocator

    The human's evolving role: deciding what's worth spending compute on; ~1% of generated tokens ship, 99% is scaffolding…

  • Disposable Micro-Apps

    Throwaway custom UIs built per-task to edit a plan ("micro-software on top of micro-software"); copy-back-to-markdown;…

  • HTML as the New Markdown

    Thariq Shihipar's thesis: as models improve, thousand-line markdown plans overwhelm the *human*; HTML artifacts (visual…

  • LLM-as-Compiler Knowledge Base

    Karpathy's architecture: LLM incrementally compiles raw docs into a persistent interlinked wiki, replacing RAG with a 4…

  • Thariq Shihipar

    Engineer on the Claude Code team at Anthropic; "HTML is the new markdown" and "compute allocator" framings; three HTML-…

  • The Bitter Lesson

    Sutton 2019: scaled general methods beat hand-engineered structure; recurring justification across the wiki for dissolv…

  • Vibe Coding vs. Agentic Engineering

    Vibe coding raises the floor (anyone builds); agentic engineering preserves the quality bar while going faster; ">10x a…

Related articles
  • Harness Shrinkage as Models Improve

    Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…

  • Claude Code

    Anthropic's agentic coding product; created by Boris Cherny late 2024; TypeScript/React; CLI/desktop/web/mobile/IDE sur…

  • Andrej Karpathy

    Co-founder OpenAI, ex-Tesla AI, Eureka Labs; coined "vibe coding," Software 1/2/3.0, "ghosts not animals," "agentic eng…

  • Compute Allocator

    The human's evolving role: deciding what's worth spending compute on; ~1% of generated tokens ship, 99% is scaffolding…

  • Outsource Your Thinking, Not Your Understanding

    "You can outsource your thinking but not your understanding"; understanding as the non-delegable human bottleneck; know…