COUPLING metrics — day/week/month tracking of Opus-orchestrates-Sonnet-subagent ratio on claude.gf.cx
DARE.CO.UK · PARKED SKETCH · 2026-06-11
Mirrored from ~/.claude/.../memory/parked_sketch_coupling_metrics_day_week_month_2026-06-06.md. This is a design sketch parked for future build — read for context, not as a current deliverable.
As the model-tier split pattern (Opus reviews, Sonnet subagents heavy-lift) takes hold across happiness.gf.cx voicing + future editorial work, the visible signal of “is the operating model actually saving Opus tokens” is a COUPLING ratio — Sonnet-subagent share of total work — tracked over rolling 24h / 7d / 30d windows. Today’s pilot (Ch1 + Ch2 of Vitality Blueprint) added ~62k Sonnet tokens; they’re already in the 72% weekly envelope but invisible as Sonnet contribution. The metric makes the coupling visible and gives Dan a forward indicator of efficiency.
Sketch: add a COUPLING metric to claude_usage_dashboard.py (and surface on claude.gf.cx + the status.gf.cx Claude cockpit card) — the share of work delivered via Sonnet subagents vs Opus directly, tracked over rolling 24h / 7d / 30d windows.
Why (Dan 2026-06-06):
“we should track COUPLING metrics day-week-month” “looking forward to catching some sonnet-percentage from COUPLING models”
Today’s pilot validated the Opus-orchestrates-Sonnet-subagent operating model (memo: feedback_happiness_pipeline_model_tier_split_2026-06-06). Two Ch1+Ch2 Sonnet dispatches added ~62k Sonnet tokens to the weekly bucket — they’re already counted in the 72% envelope but rolled into a single aggregate number. Without a coupling view, there’s no signal that “the right kind of work is being done by the right model” — only that total spend is whatever it is.
The metric becomes a leading indicator: as coupling rises, Opus tokens stretch further for the same output volume. It’s the operational mirror of feedback_compounding_kb_is_the_kernel applied to model spend.
Metric definition: