Software Factory paradigm: no-human-review coding with scenario-based validation¶


Source type	Practitioner
Confidence	Medium
Last verified	2026-03-20
Topics	`claude-code` `ai-assisted-design`

Insight: StrongDM's AI team operates a "Software Factory" where code is neither written nor reviewed by humans. Specs and scenarios drive coding agents, with validation through "satisfaction" metrics (probabilistic success rates across observed trajectories) rather than binary test passes. Their benchmark: "$1,000/day per engineer in token costs" indicates the economic threshold for this approach.

Detail: Key innovations include: (1) Scenario testing as "holdout sets" stored outside the codebase so agents can't cheat; (2) Digital Twin Universe — behavioral clones of third-party services (Okta, Jira, Slack) built by agents for unlimited testing without rate limits; (3) Attractor, their coding agent released as spec-only markdown (no code); (4) Novel terminology: "Gene Transfusion" (extracting patterns between systems), "Semports" (cross-language porting), "Pyramid Summaries" (multi-level summaries for agent context). Founded July 2025, the team identified Claude Sonnet 3.5 (Oct 2024) as the inflection point where "long-horizon agentic coding workflows began to compound correctness rather than error."

Sources

Simon Willison — "How StrongDM's AI team build serious software without even looking at the code" (2026-02-07)

Related: existing entry "Agentic coding workflow maturity levels" in external/claude-code.md — COMPLEMENTS