Skip to content
H
Howardismvol. 03 · quiet corner of the web
PLATE II · PIECE № 35HOWARDISM

Design Concept Grilling

PublishedMay 6, 2026FiledConceptReading5 minSourceAI-synthesised

Matt Pocock's `grill-me` skill; reach Brooks "design concept" before any plan; counter to specs-to-code; PRD as destination doc, Kanban as journey doc

Illustration for Design Concept Grilling

Sources#

Summary#

Matt Pocock's grill-me skill — a relentless interviewer prompt that walks down decision-tree branches one question at a time, with recommended answers — replaces "ask the agent for a plan" with "reach a shared understanding before any plan exists." The point is alignment, not output. The goal-state is what Frederick P. Brooks calls the design concept in The Design of Design: a shared idea held by all participants in the work. A PRD or a plan is downstream of the design concept; producing one without alignment first guarantees rework.

The skill (verbatim)#

"Interview me relentlessly about every aspect of this plan until we reach a shared understanding. Walk down each branch of the decision tree, resolving dependencies one by one. For each question provide your recommended answer. Ask the questions one at a time…"

That's it. The skill is short on purpose — minimum surface area, maximum behavior change.

Why grill before plan#

Pocock observed that agents in plan mode "really eagerly try to produce a plan" — say "I think I've got enough" and ship a plan that papers over open questions. The plan reads fine, but it is wrong in ways that don't surface until implementation. Forcing the agent to interview first exposes the open questions while there is still time to answer them cheaply.

The recommendation-with-each-question pattern is load-bearing: it lets the user say "yes, agreed" most of the time, only debating where they actually disagree. A pure question-only interview wastes user attention on obvious calls.

Counter to the "specs to code" movement#

Pocock's strongest negative thesis: specs-to-code is vibe coding by another name. Defenders say "write a careful spec, hand to AI, fix the spec when the code is wrong, never look at the code." Pocock has tried it: it doesn't work.

Reasons:

  • The code is the battleground, not the spec
  • Specs that don't engage with code degrade into wish-lists
  • The feedback loop runs through a layer (spec ⇄ AI ⇄ code) instead of where the bugs actually live (code ⇄ tests)
  • Without code engagement, the developer's mental model of the system rots

Grilling sits on the opposite discipline: spec is downstream of alignment, alignment is upstream of any artifact, the developer keeps a hand in the code throughout.

Outputs of a grilling session#

A grilling session can run anywhere from 10 to 100 questions; Pocock has had sessions that went an hour. The artifact at the end is the conversation history itself — kept around as raw material for the PRD step. Pocock's write-a-PRD skill consumes this history (along with another short interview) to produce a destination document.

He explicitly does not review the PRD afterwards:

"What am I testing at this point? What are the failure modes I'm trying to test for? I know that LLMs are great at summarization. I have reached the same wavelength as the LLM. So all I'm doing is checking the LLM's ability to summarize."

This is only safe because the grilling session did the alignment work. Skip grilling and you must read the PRD.

Two essential documents#

After grilling, Pocock generates exactly two documents:

  1. PRD (destination doc) — what the finished thing looks like, user stories, definition of done, out-of-scope list, implementation decisions, testing decisions, modules to be modified
  2. Kanban (journey doc) — vertical slices into independently grabbable tickets (see Vertical Slice Tracer Bullets)

He then deletes (or closes) the PRD after implementation completes — see doc rot.

Module map appears in the PRD#

The PRD includes "modules to be modified" — concrete identification of which existing modules change and which new ones are introduced. This connects planning to architecture (see Deep Modules for Agents). The point is to keep the codebase shape in mind throughout planning, not as an afterthought during implementation.

When to skip grilling#

Grilling is for human-in-the-loop tasks. For a short well-scoped change ("rename this function across the codebase"), the overhead is wasted. The discipline scales with stakes: bigger feature, fuzzier brief, higher cost of going the wrong way → grill harder.

Connections#

Open questions#

  • Can grilling be run AFK against another agent that holds the user's preferences? Pocock's answer in 2026 is "no, this part has to be human-in-the-loop" — but the question is open as agents get better at modeling their principal.
  • How does grilling change for team work where multiple humans need to align? Pocock's hint: pair-program with the agent in the room, treat it as a third interlocutor.

Sources#

§ end
About this piece

Articles in this journal are synthesised by AI agents from a curated wiki and are refreshed automatically as new concepts arrive. Topics, framing, and editorial direction are curated by Howardism.

14 articles link here
  • ConceptAgent Harness Engineering

    Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…

  • ConceptAgent Loop Pattern

    `/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…

  • EssayOpinions on Using AI Tools & the Future of the Software Engineering Role

    Debate map of four stances on using AI tools (bullish-insider / pragmatist-practitioner / skeptic-governance / architec…

  • ConceptClaude Code Best Practices

    Anthropic's guide to effective Claude Code usage: context management, verification-driven development, explore→plan→cod…

  • ConceptContext Window Smart Zone

    Smart zone vs dumb zone (Dex Hardy / Matt Pocock): quadratic attention scaling, ~100K marker independent of advertised…

  • ConceptDeep Modules for Agents

    Ousterhout deep-vs-shallow modules applied to agent-friendly codebases; push-vs-pull instruction delivery; reviewer in…

  • ConceptInteraction Models

    Thinking Machines Lab (May 2026): models that handle audio/video/text interaction natively in real time instead of via…

  • EssayLearning to Co-Work with AI: A Software Engineer's Field Guide

    Field guide for software engineers in the AI era: 6 skill clusters (taste, harness, alignment-first planning, agent-fri…

  • ConceptLLM-as-Compiler Knowledge Base

    Karpathy's architecture: LLM incrementally compiles raw docs into a persistent interlinked wiki, replacing RAG with a 4…

  • EntityMatt Pocock

    Independent AI-coding educator; built Sandcastle library; smart-zone/grill-me/tracer-bullets pedagogical framing; "bad…

  • ConceptModel Introspection Feedback

    Cat Wu's underrated technique: ask the model why it failed; treat answer as harness-debugging signal not model criticis…

  • ConceptModel Spec Midtraining (MSM)

    New training phase between pretrain and AFT: train base model on synthetic docs discussing the Model Spec; controls AFT…

  • ConceptTurn-Based Interface Bottleneck

    Why current AI interfaces limit collaboration: single-thread turn-taking is a bandwidth bottleneck; humans pushed out b…

  • ConceptVertical Slice Tracer Bullets

    Pragmatic-Programmer tracer-bullet pattern applied to agent task decomposition; vertical slices > horizontal layers; Ka…

Related articles
  • ConceptAgent Harness Engineering

    Patterns for scaffolding long-running LLM agents: environment design, progressive context disclosure, mechanical archit…

  • ConceptContext Window Smart Zone

    Smart zone vs dumb zone (Dex Hardy / Matt Pocock): quadratic attention scaling, ~100K marker independent of advertised…

  • ConceptHarness Shrinkage as Models Improve

    Prompt scaffolding shrinks each model release; Cat Wu's pruning discipline; Boris Cherny "100 lines of code a year from…

  • ConceptDeep Modules for Agents

    Ousterhout deep-vs-shallow modules applied to agent-friendly codebases; push-vs-pull instruction delivery; reviewer in…

  • ConceptAgent Loop Pattern

    `/loop` (cron-scheduled) and Ralph Wiggum (backlog-draining) loops as next-generation agent primitive; AFK execution, p…