Skip to content
H
Howardismvol. 03 · quiet corner of the web
Vol. 03 · Journal of quiet software

Built in public, read in quiet.

CuratorHowardismBasedTaiwan, 23°NWritingSince 2022FrequencyMonthly, ish
01
Plate I · Surface
Concept · 2 min read

Encoder-Free Early Fusion

Multimodal design with minimal pre-processing instead of large standalone encoders: dMel audio embedding, 40×40-patch hMLP for frames, flow head for audio out, all co-trained from scratch in one transformer

Concept · 3 min read

Full-Duplex Interaction

Perceive-and-respond simultaneously across modalities; proactive interjection, visual-cue reactions, simultaneous speech, live translation/commentary, time-aware speech — all special cases of model behavior

Concept · 3 min read

Interaction / Background Model Split

Dual-model architecture: time-aware interaction model stays present; async background model handles deep reasoning/tools; rich-context-package delegation; "reasoning-model planning at non-thinking latency"

Concept · 6 min read

Interaction Models

Thinking Machines Lab (May 2026): models that handle audio/video/text interaction natively in real time instead of via harness; interactivity scales with intelligence only if it's in the model

Work
  1. Oddle
    Oddle
  2. Lootex
    Lootex
  3. FST
    FST Network
  4. Rocaf
    ROCAF