Howardism · Vol. 03Plate II · No. 02

Architecture, in order.

Notes21TopicArchitectureOldest10 Apr 2026Newest23 May 2026

Model internals: encoder-free fusion, training, scaling, evals.

Architecture articles, sorted by date, newest first.
Title	Summary	Date
The AI-Native Safe-Choice Inversion	Buying the legacy incumbent used to be "safe"; post-AI, being the incumbent = not AI-native; boards give buyers air cover; a counter-positioning play	23 May 2026
Building Is Cheap, Arguing Is Expensive	"In technical debate, code wins": generate three PRs vs whiteboard; prototype over design doc; reduce design docs	23 May 2026
Code as Source of Truth	Docs go stale at high coding throughput; check specs/skills into the repo; onboard via Claude; spec-drift verification	23 May 2026
Founder-Led Sales Discipline	Stay founder-led until PMF; don't offload sales to an AE or an agent; explicit tension with Founder As Agent Orchestrator	23 May 2026
Jagged Intelligence (Ghosts, Not Animals)	"Ghosts not animals": jagged statistical circuits, no intrinsic motivation; car-wash/strawberry failures; stay in the loop, treat as tools	23 May 2026
Narrow Wedge into a Legacy Market	Disrupt without being feature-complete: be the best for a narrow customer profile (tech cos outgrowing QuickBooks); Google-Sheets MVP; the wedge-flip lesson	23 May 2026
Outsource Your Thinking, Not Your Understanding	"You can outsource your thinking but not your understanding"; understanding as the non-delegable human bottleneck; knowledge bases as understanding-tools	23 May 2026
Product Velocity as Moat	Shipping speed as differentiator + trust signal ("you'll scale with us"); a treadmill that must convert into durable lock-in	23 May 2026
Software 3.0	Karpathy's taxonomy: 1.0 code, 2.0 weights, 3.0 prompting; LLM as programmable interpreter; MenuGen "shouldn't exist"; neural-net-as-host-process extrapolation	23 May 2026
The Verifiability Thesis	LLMs automate what you can verify as computers automate what you can specify; RL verification rewards → jagged peaks; "verifiable + labs care"; everything eventually verifiable	23 May 2026
Problem-Solution Fit Discipline	Idea-stage thesis: three defenses against premature building (time, resources, belief friction) all eroded; AI as devil's advocate is the antidote to confirmation-bias-with-research-engine	18 May 2026
The Bitter Lesson	Sutton 2019: scaled general methods beat hand-engineered structure; recurring justification across the wiki for dissolving harnesses into models; caveat — mechanical verification and character may not migrate inward	13 May 2026
Time-Aligned Micro-Turns	The core interaction-model move: input/output as continuous streams in ~200ms interleaved chunks, no turn boundaries; streaming-sessions inference (upstreamed to SGLang), latency-tuned MoE kernels, bitwise trainer-sampler alignment	13 May 2026
TML-Interaction-Small	TML's first interaction model: 276B MoE / 12B active, audio+video+text in / text+audio out, 200ms micro-turns, async background agent; best turn-taking latency of any model; research preview May 2026	13 May 2026
Model Spec Midtraining (MSM)	New training phase between pretrain and AFT: train base model on synthetic docs discussing the Model Spec; controls AFT generalization; cuts agentic misalignment 54%→7%; beats deliberative alignment baseline	8 May 2026
Model Spec Science	Empirical study of which Model Spec features best generalize alignment; value explanations > rules alone, specific > general "be ethical" framing; first concrete examples in Li et al. 2026	8 May 2026
Opus 4.6 → 4.7 Changes and Multi-Agent Coding Considerations	4.6→4.7 delta table + six hazards for multi-agent coding teams: role-based model selection, prompt re-tuning, harness invariants, per-agent context budget, unattended-fan-out safety, independent reviewer	17 Apr 2026
Scale-Dependent Prompt Sensitivity	Large models underperform small ones on 7.7% of standard benchmarks due to overthinking; brevity constraints recover 26pp and fully reverse hierarchy on GSM8K/MMLU-STEM	14 Apr 2026
When to Use Claude Opus 4.6 for Work	Decision rules for Opus 4.6 deployment: solver-not-planner, elaboration-load-bearing tasks, brevity constraints, Pareto frontier check	14 Apr 2026
LLM-as-Compiler Knowledge Base	Karpathy's architecture: LLM incrementally compiles raw docs into a persistent interlinked wiki, replacing RAG with a 4-phase ingest→compile→query→lint pipeline	10 Apr 2026
What Are AI Tools?	Overview of AI tools landscape and categories	10 Apr 2026